Emergic: Rajesh Jain's Blog

Emergic: Rajesh Jain's Blog header image 2

Text Mining the NYT

August 3rd, 2006 · No Comments

ZDNet writes:

Text mining is a computer technique to extract useful information from unstructured text. And it’s a difficult task. But now, using a relatively new method named topic modeling, computer scientists from University of California, Irvine (UCI), have analyzed 330,000 stories published by the New York Times between 2000 and 2002 in just a few hours. They were able to automatically isolate topics such as the Tour de France, prices of apartments in Brooklyn or dinosaur bones. This technique could soon be used not only by homeland security experts or librarians, but also by physicians, lawyers, real estate people, and even by yourself.

Tags: Software

0 responses so far ↓

  • There are no comments yet...Kick things off by filling out the form below.

Leave a Comment