Web-Based Programming for Real-Time News Acquisition

Report No. ARL-MR-671
Authors: Andrew M. Neiderer; John Richardson
Date/Pages: September 2007; 36 pages
Abstract: This report describes a Web 2.0 application that was developed at the U.S. Army Research Laboratory in support of its Real-Time News Analysis (RTNA) project. It uses the Google, Inc. AJAX search application programming interface to acquire data and subsequently formats resultant data for analysis. News stories for a specified topic (e.g., terrorist bombing) are gathered from public sources by a function in a JavaScript node of an extensible markup language formatted document (XHTML). Content of selected elements is then extracted, or scraped, from the XHTML. The designed graphical user interface allows one to choose up to 10 words and/or phrases and permits explicit exclusion of certain semantics. Presently, the selected data sources are determined by Google News and user-specified in a Google Web service. A Google gadget for Maps has been added for geographic visualization of location, and additional searchers for Google Video, Blog, and Book have been tested and can be easily added to the search controller. The application also allows for integration of asynchronous JavaScript and XML technology, including Java servlets for requesting data and Java Server Pages for the responses.
Distribution: Approved for public release
  Download Report ( 0.000 MBytes )
If you are visually impaired or need a physical copy of this report, please visit and contact DTIC.

Last Update / Reviewed: September 1, 2007