A frequent problem in data exchange is the need to map between alternative formats for data. The AutoMed platform provides a repository for BAV transformations that can be used to specify the logical mapping between these alternative formats of data. The aim of this project is to write a compiler to read BAV transformations from the AutoMed repository, and to produce applications (in Java, C#, and/or SQL) that are capable of processing large volumes of data (meaning data sets containing thousands of rows of data, as opposed to small demonstration files).
The objective of this project will be to develop a set of transformation patterns that can be used to solve the general problem of data exchange between databases with conflicting representations of data, to implement those patterns as tools that create BAV transformations in the AutoMed API, and to then compile those transformations into code that can be used in routine data processing (eg Java with JDBC, or SQL with views on remote tables). The project should also illustrate the application of those transformation patterns and compilation tools in a case study.
There will be a PhD award available for students in the general area of Data Integration from October 2011, in which work on this project could be continued to PhD level.