This page contains information about projects of the applied inductive learning course.

Planning

Tutorial October 1, 2015 B28, 2.93
Project October 25, 2015 23:59 GMT+2 Project 1 - Classification algorithms, sources
Project November 22, 2015 23:59 GMT+2 Project 2 - bias and variance analysis (updated Nov 13) Deadline delayed to November 22.
Project November 10 Project 3 - Taxi challenge
Project December 14, 2015 23:59 GMT+1 Project 3 - Submission of the challenge report
Project December 17 (date to be confirmed) Project 3 - Oral presentation

Report submission

Project 3

The third project is organized in the form a challenge, where you will compete against each other. We will provide you with some training data related to a given supervised learning task (predicting the destination of taxi trips) and some test data with unknown labels in order respectively to train your model and to validate it. You can use any techniques and softwares you want to build the best possible model from the training data. During the course of the project, you will be allowed to submit your predictions on the test data and an intermediate ranking of the different teams will be provided according to their current best scores on the public leaderboard. At the closure of the challenge, the final ranking will be established according to the best score of each team on the held out test data (You should thus be careful not to overfit the first half of the test data!).

To handle this challenge, we use the Kaggle platform. You can access the competition at this address: Taxi challenge.

To join, each group member first needs to create an account on Kaggle. Then, one group member needs to create a team (option "My Team" in the Dashboard at the left) and invite other group members to join his team. Submissions to the challenge should only be introduced through your team.

At the end of the challenge, we ask you also to write a report that describes the different steps of your approach and your main results. You should also send us your source code. The report must at least contain:

Instructions to submit your project are the same as for the other projects.

Project 3 - oral presentation

Oral presentations of your solutions will be organised on the last course slot on December, 17 (date to be confirmed). Detailed instructions about these presentations will be provided in due time.

More ressouces

FAQ

Project 1

Which Python version can I use for the project?
You can use either Python 2.7, 3.3 or 3.4. In order to run smoothly your code with Python 2 and 3, you can avoid Python 2+ or 3+ only features and add the following import to the top of the file:
# Only py3 / so that 2 / 3 = 0.66..
from __future__ import division
# Only py3 string encoding
from __future__ import unicode_literals
# Only py3 print
from __future__ import print_function
How can I plot finer boundary plots?
To refine the mesh used for the plot, set the mesh_step_size from plot_boundary to a lower value.
On linux and MacOS, how can I know that I am using the right Python interpreter?
You can simply do
$ which python
What is counted in the 5 pages report?
Everything from the first page to the last page of your pdf report. Appendix are also counted in the pages count. The report must be in A4 page size, single column format and appropriate typography, (e.g. a font size of 11-12pt with appropriate spacing). Figures should also be of appropriate size and design for a scientific report.