MP4 | Video: 720p | Duration: 4:14:33 | English | Subtitles: VTT | 570.8 MB
With the increased amount of data publicly available and the increased focus on unstructured text data, understanding how to clean, process, and analyze that text data is tremendously valuable. If you have some experience with Python and an interest in natural language processing (NLP), this course can provide you with the knowledge you need to tackle complex problems using machine learning. Instructor Derek Jedamski provides a quick summary of basic natural language processing (NLP) concepts, covers advanced data cleaning and vectorization techniques, and then takes a deep dive into building machine learning classifiers. During this last step, Derek shows how to build two different types of machine learning models, as well as how to evaluate and test variations of those models.
- What are NLP and NLTK?
- Using regular expressions
- Using stemming and lemmatizing
- Methods to vectorize raw data
- Building and evaluating machine learning classifiers