Experimenting with Latent Semantic Analysis
Realizing that the Fall semester was fast approaching, I hurried this weekend to finish my final paper for Prof. Black's class. I mentioned this paper last month here on the blog, which using uses semantic analysis as a way of categorizing texts (using TCR as the text-base). The paper is included here as an attached Word file if you are interested.
Abstract: The aim of this paper is to test both the practical use value and the psychological underpinnings of Latent Semantic Analysis (LSA), which is a statistical theory and method for extracting and representing the contextual meaning of words. To test the practical use value, we will use LSA to analyze a large corpus of articles within a particular discourse and ask, can LSA decide which category each article goes in? Is LSA able to categorize as well as a human editor? To test LSA's ability to simulate psychological processes, we will experiment with Kintsch, Patel and Ericson's (1999) hypothesis that the semantic space created by LSA is similar to an expert's Long Term Working Memory (LTWM).