WOVe: Incorporating Word Order in GloVe Word Embeddings
Abstract
Word vector representations open up new opportunities to extract useful information from unstructured text. Defining a word as a vector made it easy for the machine learning algorithms to understand a text and extract information from. Word vector representations have been used in many applications such word synonyms, word analogy, syntactic parsing, and many others. GloVe, based on word contexts and matrix vectorization, is an effective vector-learning algorithm. It improves on previous vector-learning algorithms. However, the GloVe model fails to explicitly consider the order in which words appear within their contexts. In this paper, multiple methods of incorporating word order in GloVe word embeddings are proposed. Experimental results show that our Word Order Vector (WOVe) word embeddings approach outperforms unmodified GloVe on the natural language tasks of analogy completion and word similarity. WOVe with direct concatenation slightly outperformed GloVe on the word similarity task, increasing average rank by 2%. However, it greatly improved on the GloVe baseline on a word analogy task, achieving an average 36.34% improvement in accuracy.
Keywords
Full Text:
PDFReferences
Ibrahim, M., Gauch, S., Gerth, T., & Cox, B. (2022). WOVe: Incorporating Word Order in GloVe Word Embeddings. International Journal on Engineering, Science and Technology (IJonEST), 4(2), 124-129.
DOI: https://doi.org/10.46328/ijonest.83
Refbacks
- There are currently no refbacks.
Copyright (c) 2022 International Journal on Engineering, Science and Technology
Abstracting/Indexing
International Journal on Engineering, Science and Technology (IJonEST)-ISSN: 2642-4088
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.