澳门新葡8455最新网站,www.8455.com,新葡京最新官网

当前位置: 澳门新葡8455最新网站 > 学术活动 > 正文
Statistical Dictionary Models for Mining Chinese Texts
澳门新葡8455最新网站:2014年10月28日 00:00 点击数:

报告人:邓柯

报告地点:澳门新葡8455最新网站105室

报告澳门新葡8455最新网站:2014年10月31日星期五14:00-15:00

邀请人:

报告摘要:

With the explosive growth of the internet and digital technologies, large quantities of digitalized text data can be easily collected. Thus there is great appeal in developing text mining tools to automatically extract information from these data and create new knowledge. Because natural languages are very noisy and the data size is huge, it is not productive to base this on precise linguistics. Instead, as many have seen, methods based on statistical models have great advantages, even if they miss some subtleties in the text. In this talk, I will introduce a series of novel approaches to model and mine Chinese text data: a "word dictionary model" to discover patterns of serial units such as words/phrases and achieve text segmentation, a "theme dictionary model" to recognize long range associations among text units, and a "concept network" to incorporate domain knowledge into text analysis. Using these approaches separately or jointly, we can give answers to many important practical problems.

主讲人概况:

Dr. Ke Deng got his Ph.D. at Department of Probability and Statistics, Peking University in 2008. He received his B.S. in mathematics at Peking University in 2003. Before joining MSC, he was a research associate at Statistics Department, Harvard University. His research interests are Bayesian methodology, sequential monte carlo, bioinformatics, statistical genetics, text mining, network tomography, social sciences, Chinese medicine. He will be a tenure track assistant professor of MSC from September, 2013.

Copyright ©版权所有:澳门新葡8455最新网站

地址:吉林省长春市人民大街5268号|邮编:130024|电话:0431-85099589|传真:0431-85098237


XML 地图 | Sitemap 地图