Text-based industry classication, BERT, word2vec, doc2vec, latent se- mantic indexing, cosine similarity, k-means, Gausian mixture model, deep embedding for clustering