Publication Date

Fall 2017

Degree Type

Master's Project

Degree Name

Master of Science (MS)

Department

Computer Science

Abstract

Many words have multiple meanings. For example, “plant” can mean a type of living organism or a factory. Being able to determine the sense of such words is very useful in natural language processing tasks, such as speech synthesis, question answering, and machine translation. For the project described in this report, we used a modular model to classify the sense of words to be disambiguated. This model consisted of two parts: The first part was a neural-network-based language model to compute continuous vector representations of words from data sets created from Wikipedia pages. The second part classified the meaning of the given word without explicitly knowing what the meaning is. In this unsupervised word sense determination task, we did not need human-tagged training data or a dictionary of senses for each word. We tested the model with some naturally ambiguous words, and compared our experimental results with the related work by Schütze in 1998. Our model achieved similar accuracy as Schütze’s work for some words.

Recommended Citation

Liu, Qiao, "Word Sense Determination from Wikipedia Data Using Neural Networks" (2017). Master's Projects. 567.
DOI: https://doi.org/10.31979/etd.8np8-7kk9
https://scholarworks.sjsu.edu/etd_projects/567

Download

Included in

Computer Sciences Commons

COinS

DOI

https://doi.org/10.31979/etd.8np8-7kk9

Master's Projects

Word Sense Determination from Wikipedia Data Using Neural Networks

Publication Date

Degree Type

Degree Name

Department

Abstract

Recommended Citation

Included in

DOI

Search

Browse All

Links

Master's Projects

Word Sense Determination from Wikipedia Data Using Neural Networks

Author

Publication Date

Degree Type

Degree Name

Department

Abstract

Recommended Citation

Included in

Share

DOI

Search

Browse All

Links