WOVe: Incorporating Word Order in GloVe Word Embeddings

Authors

DOI:

https://doi.org/10.46328/ijonest.83

Keywords:

Word embeddings, Vector learning, Attention mechanisms

Abstract

Word vector representations open up new opportunities to extract useful information from unstructured text. Defining a word as a vector made it easy for the machine learning algorithms to understand a text and extract information from. Word vector representations have been used in many applications such word synonyms, word analogy, syntactic parsing, and many others. GloVe, based on word contexts and matrix vectorization, is an effective vector-learning algorithm. It improves on previous vector-learning algorithms. However, the GloVe model fails to explicitly consider the order in which words appear within their contexts. In this paper, multiple methods of incorporating word order in GloVe word embeddings are proposed. Experimental results show that our Word Order Vector (WOVe) word embeddings approach outperforms unmodified GloVe on the natural language tasks of analogy completion and word similarity. WOVe with direct concatenation slightly outperformed GloVe on the word similarity task, increasing average rank by 2%.  However, it greatly improved on the GloVe baseline on a word analogy task, achieving an average 36.34% improvement in accuracy.

Author Biographies

Mohammed Salah Ibrahim, University of Arkansas

Mohammed Ibrahim, Ph.D.Department of Computer Science and Computer Engineering University of Arkansas, Fayetteville, Ar E-mail: msibrahi@uark.edu, mohammad.alaubeedy@gmail.com 

Susan Gauch

Dr. Susan Gauch Professor Computer Science & Computer Engineering University of Arkansas

Tyler Gerth, University of Arkansas

Tyler GerthUniversity of Arkansas, US, tdgerth@uark.edu

Brandon Cox, University of Arkansas

Brandon CoxUniversity of Arkansas, US, bscox@uark.edu

References

Ibrahim, M., Gauch, S., Gerth, T., & Cox, B. (2022). WOVe: Incorporating Word Order in GloVe Word Embeddings. International Journal on Engineering, Science and Technology (IJonEST), 4(2), 124-129.

Downloads

Published

2022-12-11

Issue

Section

Engineering