Towards the Development of Typhoon-Related Tweet Classifiers Despite the Sparseness of Strongly-Annotated Data


Click here for the link to the paper.

The paper was presented at the 2017 IEEE Region 10 Conference (TENCON 2017)

TENCON 2017 is expected to bring together researchers, educators, students, practitioners, technocrats and policymakers from across academia, government, industry and non-governmental organizations to discuss, share and promote current works and recent accomplishments across all aspects of electrical, electronic and computer engineering, as well as information technology. Distinguished people will be invited to deliver keynote speeches and invited talks on trends and significant advances in the emerging technologies.


  • Alron Jan Lam
  • Nathaniel Oco
  • Rachel Edita Roxas


We employ a method for weakly annotating typhoon-related tweets in situations where there is only a small strongly-annotated dataset. The idea is that classifiers can be trained on a larger weakly-annotated dataset generated from this smaller dataset. In this research specifically, four binary classifiers were trained to detect the presence of the following themes in tweets: (1) Agonism or Engagement in Debate, (2) Celebrification, (3) Solidaristic, and (4) Tweeting about a Charity Event. The classifiers were then evaluated qualitatively on tweets about the original typhoon (Haiyan in 2013) from which the annotations came from, and on tweets about a typhoon years later (Haima in 2016). This was done by running LDA topic modelling on the positive classifications and qualitatively analyzing the resulting topics for relevance to the themes. Results suggest that the trained classifiers could be useful for the typhoon with which it was trained on, but not necessarily for other typhoons.

Posts created 35

Leave a Reply

Your email address will not be published. Required fields are marked *

Related Posts

Begin typing your search term above and press enter to search. Press ESC to cancel.

Back To Top