謝舒凱 Graduate Institute of Linguistics
Associate Professor, National Taiwan University
謝舒凱 Aber 德國杜賓根大學計算語言學博士|中研院語言學研究所博士後研究員| 台灣大學語言學研究所副教授
施孟賢 Simon 台灣大學語言學博士班
張瑜芸 Taco 台灣大學語言學博士班
Biological signals, music, images, video, customer reviews, webpages, medical records, software, game logs, social networks, environmental signals, astro-data, neuron spikes, etc.
This frontier is expanding vastly thanks to new developments in mathematical modelling, algorithms, data management and computing infrastucture. It is having a profound impact not only in science and medicine, but also in e-commerce, marketing, humanities and society at large. Inference and learning with massive datasets is also the key ingredient of the intelligent machines of the future.
文本數據分析：（DS 的一支）利用 NLP + ML 對於文本數據做各種預測與應用
This course will provide an introduction to this exciting growing cross-disciplinary field. It will teach the basic principles and skills required for analysing textual data in a programmable way: finding linguistic patterns, dimensionality reduction, clustering, classification and prediction. Students will also have the opportunity of learning R and command-line programming.