Other articles

  1. 15 Years of News – Analyzing CNN Transcripts: Visualizing Topics

    [caption id="attachment_1080" align="alignleft" width="150"]chart-scatter High-level visualization of topics in CNN’s corpus[/caption]

    By extracting several topics from our news corpus, we gained a 10,000 feet view of corpus. We were able to outline many trends and events, but it took a bit of digging. This …

    read more
  2. A Map of the Geographic Structure of Wikipedia Topics

    [caption id="attachment_685" align="alignleft" width="150"]Wikipedia Topic 260 Mountains, peaks, summits, etc.[/caption]

    A large number of Wikipedia articles are geocoded. This means that when an article pertains to a location, its latitude and longitude are linked to the article. As you can imagine, this can be useful to generate insightful …

    read more