This skill test was designed to test your knowledge of Natural Language Processing.If you are one of those who missed out on this skill test, here are the questions and solutions.
A) 0 B) 25 C) 50 D) 75 E) 100 Solution: (A) LDA is unsupervised learning model, LDA is latent Dirichlet allocation, not Linear discriminant analysis.
A) 5, 5, 2 B) 5, 5, 0 C) 7, 5, 1 D) 7, 4, 2 E) 6, 4, 3 Solution: (D) Nouns: I, New, Delhi, Analytics, Vidhya, Delhi, Hackathon (7) Verbs: am, planning, visit, attend (4) Words with frequency counts 1: to, Delhi (2) Hence option D is correct.
11) In a corpus of N documents, one document is randomly picked.
Which of the following is correct, in regards to document term matrix?
A) Only 1 B) Only 2 C) Only 3 D) 1 and 2 E) 2 and 3 F) 1, 2 and 3 Solution: (D) Choices A and B are correct because stopword removal will decrease the number of features in the matrix, normalization of words will also reduce redundant features, and, converting all words to lowercase will also decrease the dimensionality.