EChronicle

eChronicle is a chronicling software that supports the general purpose information archiving using its specialized information models and backends. This site is the home of eChronicle.

  • Development
  • Increasing semantic indexing dictionary Dictionary preparation for semantic indexing Semantic indexing is necessary when the type and size of data is diverse. Without semantic indexing ,by which a user can access or search data by semantic knowledge, a user has to figure out the correct or at least a portion of data values or as well known, keywords, to query the database. EFIM models proposed here can serve the general purpose that unlimited number of heterogeneous sources can be mingled together via events.

  • Dictionay Preparation for Semantic Indexing
  • Semantic indexing is necessary when the type and size of data is diverse. Without semantic indexing ,by which a user can access or search data by semantic knowledge, a user has to figure out the correct or at least a portion of data values or as well known, keywords, to query the database. EFIM models proposed here can serve the general purpose that unlimited number of heterogeneous sources can be mingled together via events.

  • Documentation
  • Abstract This paper presents a new framework for multimedia electronic chronicling systems. Its approach uses events as its driving force for heterogeneous information processing. Specifically, this new approach first separates symbols and data, then puts events between them to make a distinct connection. In addition, this approach provides spatio-temporal-semantic relations networks to map high-level semantic user queries into low-level queries that a machine can compute. The innovative user interfaces are designed for ease of use and are interactive to allow organization of information and a search capability for information retrieval.

  • How does eChronicle work?
  • Just like many personal information chronicling solutions, eChronicle is one of those. However, a number of unique features exist in the eChronicle. Proprietary database solution. Support libraries to encapsulate the back-end solution. Suggest the new programming paradigm.

  • Importing Wikipedia 24th July 2008
  • Old works I recently reworked on the new Wikipedia database archived on 07/24/2008. Old works for old 02/06/2007 Wikipedia English version database can be found at here EChronicle::Importing Wikipedia February 6th 2007. Have any questions? Be sure to read FAQ page which is the collection of actual troubleshooting procedures that I helped many around the world. Send me an email: phkim AT ece DOT gatech DOT edu Download Wikipedia database Detail information to download Wikipedia database is available at Wikipedia database download.

  • Importing Wikipedia February 6th 2007
  • Recent works This work is done on 02/06/2007. For the recent updated work on newer Wikipedia databases are at EChronicle:Wikipedia database. Have any questions? Send me an email: phkim AT ece DOT gatech DOT edu Download MediaWiki First install Apache+PHP+MySQL. English users may visit EasyPHP. Korean users may use APMSETUP. Note! MySQL 5.0.22 passed this test. MySQL 5.1 does not passed the test. It does not handle the Wikipedia text table properly.

  • Multimedia Sample Preparation
  • Scenario Capture videos on the road and display related information. Hardware DV Cam video GPS Laptop for GPS recording and/or DV cam direct capture Software HyperTerminal (Windows software to capture GPS text streams) GPS USB interface Serial Emulator should be pre-installed. Download DeLorme Serial Emulation Driver for USB Earthmate® GPS and Earthmate® GPS LT-20 Receiver. This drive will create a number of virtual serial ports. Choose a port supporting the NMEA (National Marine Electronics Association) data format.

  • OpenNLP
  • Original developers refer that OpenNLP is an organizational center for open source projects related to natural language processing. Its primary role is to encourage and facilitate the collaboration of researchers and developers on such projects. In my personal interests, I was very interested in their package, sentence detector. On the Web, you may find many open sources for tokenizer and parsers for one sentence but not sentence detector that will parse sentences from a given paragraph.

  • Why is this necessary?
  • Limitations in existing storage solutions RDBMS RDBMS is a short for Relational Database Management System. They most follow the SQL-99 standard. A number of limitations defer the eChronicle style storage system. Arrays violate the rules of First Normal Form (1NF) required for a relational database, which say that the tables have no repeating groups in any column. A repeating group is a data structure that is not scalar; examples of repeating groups include linked lists, arrays, records, and even tables within a column.

  • Wikipedia database
  • This page includes pretty old works. Just refer it to get some ideas :) Old works I have been working on the Wikipedia database dump for several years. Belows are the list of previous works. Regretfully, Wikipedia lost their old database. So the source data is not available any more. Wikipedia dump, 07/24/2008: EChronicle::Importing Wikipedia July 24th 2008 Wikipedia dump, 02/06/2007: EChronicle::Importing Wikipedia February 6th 2007 Have any questions?

  • Wikipedia database FAQ
  • Importing time Hi, just wanted to ask: how long do these operations take (and on which hardware)? I want to know when it’s taking way too long… Anonymous Answer Hello Anonymous, Such time factors depend heavily on your system configuration. For your reference, it took me less than one day with a computer having 2.4GHz Intel P4 CPU, 1G memory, 250GB 5400rpm hard disk under Windows XP SP2, and MySQL MyISAM configurations.

  • Wikipedia raw text conversion
  • Wikipedia is a big database occupying over pure database table size, 56GB, plus media files (> 70 GB). Wikipedia texts are written using Wiki syntax. Hence to utilize its contents for information processing, this section will illustrate the step to convert Wiki texts into raw texts. The below regular expressions are tested on Tropicsoft regular expression libraries. Regular expression in C++ format (Most recent) m_sWikiRegExps[WIKI_ELEMENT_REDIRECT] = _T("^\\#(REDIRECT|redirect).?\[{2}(.+)\]{2}"); m_sWikiRegExps[WIKI_ELEMENT_GALLERY] = _T("<(gallery|GALLERY)\\b[^>]*>(.*?)</\\1\\b.*?>"); m_sWikiRegExps[WIKI_ELEMENT_HEADING] = _T("^[=]+([^=]+)?