Thimal Jayasooriya

Main page


About me


UniLiteral


NLP toolkits


List of links


Hi. I'm Thimal Jayasooriya, and I'm a third year PhD student supervised by Dr. Suresh Manandhar.

My research area is mainly to do with information retrieval, text mining and natural language processing. This is related to the work done by some other members of the Artificial Intelligence Group.

I am currently a research student funded by the Ubiquitous Digital Agents (UDA) project. The UDA project is part of Amadeus, a collaboration between numerous academic and industrial partners funded by the DTI NTWM initiative. The UDA is primarily tasked with providing natural language agent interfaces for various appliances and devices. Previous work done in this project included the SmartFridge experiments, a graphical agent interface to a refrigerator.

I am presently involved in several areas of research related to the UDA project. One area is the optimized storage and retrieval of information within the context of a resource constrained environment. Another is the expansion and efficient processing of queries, also operating in an environment with limitations on processing power and available memory (for example, a set top box for the recording and playback of digital media).

Some details about my literature review talk discussing work done last year (January, 2004) can be found here

Previously, I also worked on identifying semantic classes for various words found within documents. This involved indexing content within a large document set into dimensions. Some work analysing the capability of these dimensions in returning relevant search results has been published and is available here.

I have also worked on crawling and indexing content in languages other than English, specifically South Asian languages. The main problems in this area appear to be lack of lexical resources and nonstandard representation mechanisms. UniLiteral, a conversion engine from legacy fonts to Unicode attempts to provide a smooth transition between commercially available representation schemes and their Unicode equivalent.

Room 203O,
Department of Computer Science,
The University of York,
Heslington,
York YO10 5DD
Email

Telephone +44-(0)1904-432757
Fax +44-(0)1904-432767