|Search & Index|
At the conception of the ENAWU project, how to search across multiple websites, with different types of data - either online, and in some cases with secure user/membership constraints was a challenge that was likely to require extremely heavy programming or considerable expenditure. However, the solution to the first challenge - a text or context oriented cross-platform search - was close at hand.
Our IT development team carried out considerable research into free, open source and proprietary solutions (running into thousands of dollars per year) that would be capable of:
Eventually, we decided on the Google Custom Search application - despite the inherent risks and cynical criticisms of Google's overall impact on our internet experience these days.
We appreciate and respect that - for some of our partners - the Google solution appears too simple. However, while we continue to attempt to address that - there were several reasons for our choice:
And this leads us to the more complicated and challenging requirement - both for our own organisations (Arab West Report, CIDT, CAWU) and for several others - building not only an indexing system, but then using that indexing system to give ENAWU users an advanced, index (topic) based search capability.
Indexing & Classification
As we have mentioned in the section about the Arab West Report index , there are many ways that data is and can be classified and indexed these days. Indeed, in addition to the traditional systems such as the Library of Congress Classification (LCC) and Dewey Decimal Classification (DDC) (which have been adapted over the years from well established bibliographical systems for use in computer applications) other systems have been designed with a different focus - like making data information (such as meta data) more easily accessible to a variety of data reading / mining / searching / managing applications - the MARC Standards are a good example of this.
At the moment - thanks to the funding of ENAWU Phase 1 - finding and implementing a suitable index for ENAWU is being researched and developed around our own custom index (which has been developed over 10 years in order to make our own unique archive of more than 20,000 texts accessible) and at the same time mapping this index to both the LCC and DDC classification systems. Below is a snapshot of the AWR index as it is today (click on the image to see it in the context of the website). This is not yet complete, but we are very happy with the progress being made.
The other challenge associated with developing the index is to ensure that all of the data is properly indexed, mapped, cross referenced, uses the same names, translations, etc. So in addition to our own work we are also exploring specialist service providers who may be able to convert, cross-map and 'clean up' our existing data archive - itself a massive task.
In ENAWU Phase 2 we are looking to do two things with respect to the index:
|< Prev||Next >|