Pankaj Kumar
6 min readAug 29, 2020

Clustering Names from wikipedia article Using Word Embedding and K-Mean Clustering

Photo by Bee Balogun on Unsplash

In this article I am trying to cluster names from the names extracted from a wikipedia article. I’ll be using K-mean clustering and the distance between names will be calculated based on the word embedding vectors provided by spacy. In an earlier article we extracted names from wiki page and used spacy named entity recognizer technique to identify…

Pankaj Kumar

MS Data Science SMU TX. Pursuing MSc Financial Engg. At WQU.Interest in Algos, Discovering Trends fm data. Methodical, conven/non-conven. Investigation of data.