Nov 30, 2021 | Technology
Data cleaning is the process of fixing or removing incorrect, corrupted, incorrectly formatted, duplicate, or incomplete data within a dataset. The data format is not always tabular. As we are entering the era of big data, the data comes in an extensively diverse...
Nov 11, 2021 | Technology
SpaCy is an open-source python library used for Natural Language Processing(NLP). Unlike NLTK, which is widely used in research, spaCy focuses on production usage. Industrial-strength NLP spaCy is a library for advanced NLP in Python and Cython. As of now, this is the...
Oct 29, 2021 | Technology
Stemming and Lemmatization are text/word normalization techniques widely used in text pre-processing. They basically reduce the words to their root form. Here is an example: Let’s say you have to train the data for classification and you are choosing any...
Oct 15, 2021 | Technology
Machine Learning, Deep Learning, Artificial Intelligence are the popular buzzwords in present trends. Artificial Intelligence(AI) is the branch of computer science which deals with developing intelligence artificially to the machines which are able to think, act and...
Oct 14, 2021 | Technology
We talked about word embeddings a bit in our last article, using word2vec. Word embeddings are one of the most powerful tools available to NLP developers today, and most NLP tasks will require some kind of word embedding in one of the levels. Thus, it is important to...
Recent Comments