View this complete project on my GitHub.

Web Scraping & Natural Language Processing (NLP) with Reddit

This project is an exploration of natural language processing (NLP) models using data gathered from various discussion forums ("subreddits") on reddit.com. The goals were to:

  1. Webscrape selected topics from reddit.com utilizing the Pushshift API.

  2. Clean, tokenize, and apply various machine learning models.

  3. Ascertain if binary classification machine learning models can correctly differentiate between entrepreneurs and leadership via their respective discussion forums.

Previous
Previous

RIADA