This repo contains notebooks for building a recommender system for influencers and artists/tracks on Groover dataset.
-
01_exploration_and_preprocessing.ipynbprovides quick exploration and preprocessing of categorical data -
02_first_models.ipynbbuilds benchmark with random, naive and LGBM models, and first Neural Networks using Keras -
03_final_models.ipynbbuilds Neural Networks models incorporating textual data fromtrack_info
NB: plotly plots do not show on github, please run notebooks to display them.
- install
requirements.txtin a virtualenv:pip install -r requirements.txt - create
./data/raw/and./data/preprocessed/directories - put raw data in
./data/raw/ - install
graphviz:sudo apt-get install graphviz - run
01_exploration_and_preprocessing.ipynbfor preprocessing (preprocessed data is created and put in./data/preprocessed/directory) - run
02_first_models.ipynbto train models and benchmark - download and unzip
cc.fr.300.vecfrench pre-trained word embeddings from this link and put it in./ - run
03_final_models.ipynbto train final models