You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Create 'scraped_data', 'user_features', 'user_labels' folders in current directory
- To classify users, put dataset (with usernames) in scraped_data
- Execute run.py
- Scraper FeatExt cell ---> output: saved in 'user_features'
- Cleaning feature df cell ---> output: feature dataframe with no nans saved in 'user_features'
- Classify user cell ---> output: original scraped data + labels saved in 'user_labels'
---------------------------
- run_TimelineScraper.py:
- script for scraping user timeline only. Timelines are saved as csv in 'timeline_data' folder.
- run_FeatureExtraction.py:
- script for feature extraction only. Requires timeline data generated by run_TimelineScraper.py. Feature matrix saved as csv in 'user_features' foler.