文章预览
Stata is a complete, integrated statistical software package created by StataCorp LP ( www.stata.com ). It provides a wide range of statistical analysis, data management, and graphics. Released in June 2013, version 13 added many new features, including a long string data type allowing one to store along with numerical and categorical data, documents up to 2 billion characters. One could thus create a statistical database with journal abstracts, news transcripts, patents, incident reports, customer feedbacks, interviews and so on. WordStat for Stata was created to allow Stata 13 and Stata 14 users running under Windows, to apply text analytics techniques on any string variables stored in a Stata data file. WordStat combines natural language processing, content analysis and statistical techniques to quickly extract topics, patterns and relationships in large a
………………………………