PadhaiTime Logo
Padhai Time

Handling Textual Data

When we have textual data, then there are some different text processing steps which we need to follow:

  • You need to remove HTML tags from data if present
  • Handle Lower and Upper case inconsistency
  • Remove Punctuations
  • Remove Stop words
  • Tokenization
  • Stemming
  • Lemmatization

Once these steps are performed, you will have cleaned data which is ready for Feature Engineering step.

We have discussed all the cleaning steps in Natural Language Processing course, under the Text Cleaning chapter

Bengaluru, India
contact.padhaitime@gmail.com
  • We collect cookies and may share with 3rd party vendors for analytics, advertising and to enhance your experience. You can read more about our cookie policy by clicking on the 'Learn More' Button. By Clicking 'Accept', you agree to use our cookie technology.
    Our Privacy policy can be found by clicking here