Social Media Corpus in Taiwan (SoMe) is a LOPE project building a large-scale Taiwanese social media corpus for discourse and computational research.
Project Overview
Social Media Corpus in Taiwan (SoMe) builds a large-scale corpus from PTT and Dcard to support research in online Chinese discourse, constructional patterns, and social media language use in Taiwan. The corpus is designed to serve both qualitative discourse analysis and computational representation learning.
