Modeling Report and Plan for Testing
Different combinations of parameters should be experimented to develop the best machine learning model possible, combining, for
example, different algorithms, hyperparameters or sets of variables.
Two types of models should be presented:
The presentation of the modeling results should include:
Your report should include a description of the plan for testing, namely:
Two types of models should be presented:
- At least one simple, interpretable model, for instance, a shallow decision tree, where the model output is interpretable. The goal of this model is to verify that the pipeline is working as expected, and the modelling results make sense.
- More complex models, such as ensemble models, which should have a better performance than the simple model.
The presentation of the modeling results should include:
- Description of the different algorithms, hyperparameters and features you experimented with
- Model evaluation visualizations, comparing different models
- Selection of the best model, justifying your choice
- Definition of the groups that may be differently affected by your model, and a comparison of the best model performance for the different groups
- Future work (e.g., what would you like to had developed if you had the time).
Your report should include a description of the plan for testing, namely:
- Who are the end-users of your model?
- How would you design the interface for deploying your model?
- Which study design would you use to evaluate the adoption of your system by its end-users?
- Which measures would you use to evaluate the adoption of your system by its end-users?
- Who would participate in your pilot study and why?
- How would you prepare your pilot implementation (e.g., is training needed?)
- How many data points/ for how long should the pilot be running?