2 RESEARCH ASSOCIATES IN ARTIFICIAL INTELLIGENCE FOR DATA ANALYTICS
Opportunity for 2 Research Associates to work on advanced methods from artificial intelligence, including machine learning, deep learning, and semantic technologies, to help automate the data analytics process.
Data analytics, the process of transforming a raw dataset into useful knowledge, can be a painstaking and expensive process, but often the majority of effort lies not in building statistical models, or training machine learning algorithms, but in all the other tasks that go into in data preparation, exploration, and interpretation. The Artificial Intelligence for Data Analytics (AIDA) project at the Alan Turing Institute is an ambitious effort to develop an integrated artificial intelligence system that guides the user through every step of the process, magnifying the productivity of working data scientists.
In the first phase of the AIDA project, we will develop new methods for data ‘wrangling’, an often laborious and time-consuming process that accounts for up to 80% of a typical data science project. Data wrangling includes understanding what data is available, integrating data from multiple sources, identifying missing, messy or anomalous data, and extracting features in order to prepare data for computer modelling.
The AIDA research project will not only aim at new advances in artificial intelligence and machine learning to address data wrangling issues; we also aim to develop systems that help to automate each stage of the data analytics process. It is anticipated that the resulting technology will benefit researchers, industry and government, dramatically improve the productivity of working data scientists, and revolutionise the speed and efficiency with which data can be transformed into useful knowledge.
The AIDA team will consist of five investigators (Chris Williams, James Geddes, Zoubin Ghahramani, Ian Horrocks, Charles Sutton), three research assistants, software engineers, data scientists, and aligned PhD students.
We invite applications from talented and qualified researchers to become part of the AIDA research project. These posts will be based at the Alan Turing Institute hub in London.
THE ROLES
The AIDA project will build an intelligent data analytics system that guides an analyst through a semi-automated process of acquiring, preparing, integrating, transforming, cleaning, and understanding data for analysis. AIDA will aid data preparation by combining technologies from logical artificial intelligence and statistical machine learning. Logic-based AI contributes by providing a powerful set of tools for integrating and representing metadata about data whose complexity and heterogeneity makes it impossible to represent as a set of relational tables. Statistical machine learning provides a powerful set of techniques for inferring what is "typical" for a data source, which can underlie new techniques for identifying low-quality data, for evaluating the effect of data transformations, and for summarizing data by reporting on typical behaviour.
The two Research Associates (RAs) will take on different aspects of the work programme as described below. They will work as part of the AIDA team, and collaborate with the software engineers in order to create a unified system. These roles are initially on a three-year fixed term contract.
RA1 will work on Data Acquisition and Transformation. The goal will be to develop a data acquisition component (DAC) that uses semantic technology to support analytics tools in the acquisition and transformation of relevant data from large, distributed, dynamic, and heterogeneous data sources. The DAC will provide comprehensive data restructuring functionality, which will require the extension of semantic technologies with features such as aggregation. We will also investigate the use data analysis techniques to reveal hidden/lost data structure, and the use of axiomatic data structure to support data understanding, cleaning and analysis.
RA2 will work on Data Understanding, Data Quality, and Cleaning. The main tasks in data understanding will be around helping the data analyst to build their intuition about what is in the data, looking both for common patterns and anomalies. This will include work using techniques from statistical machine learning, deep learning and related areas to infer compact summaries of datasets, automatically inferring the types of variables from data, and producing simple reports describing statistically reliable patterns in data. For data quality and cleaning, important tasks include handling missing data, entity disambiguation, and identifying anomalies, and passing on possible uncertainty in these components to later stages of the analysis. We will explore ways of developing new data quality methods, exploiting particularly the methods of statistical machine learning, possibly in combination with semantic technologies.
DUTIES & RESPONSIBILITIES
The RAs will work with the AIDA team (including the investigators, software engineers, data scientists and PhD students) to advance the aims of the AIDA project.
The main responsibilities of the posts are:
• To conduct research on topics in artificial intelligence for data analytics;
• To collaborate in the preparation of reports, publications and presentations;
• To collaborate and/or present findings at academic and practitioner conferences and meetings.
PERSON SPECIFICATION
Essential:
· A PhD or equivalent qualification in a relevant discipline (at the point of taking up the position);
· A proven publication record in a relevant discipline (e.g. in peer-reviewed academic journals and other high-quality publication venues);
· Appropriate software skills for the implementation of research software;
· Excellent communication and interpersonal skills, for interaction with the AIDA research team and project partners, and the ability to present complex technical information.
Desirable:
· Experience in working in semantic technologies and/or statistical machine learning, or related fields.
THE ALAN TURING INSTITUTE
The Alan Turing Institute (the Turing) is the national centre for data science, established in 2015 with the mission to make great leaps in transformational data science research that will have positive real-world impacts.
The Institute has cross-disciplinarity at its core; we bring researchers in mathematics and theoretical computer science, statistics and machine learning, algorithm for data analytics and distributed computing, computational social science and data ethics, and industry partners, to work together in an open and collaborative environment with a shared goal to generate world-class research in data science.
Our researchers are motivated by driving impact, both through theoretical development and application to real-world problems. In our first year we have identified six priority sectors to focus our translational research: Data-Centric Engineering; Defence and Security; Smart Cities; Culture and Media; Financial Services; and Health and Wellbeing.
We have attracted strategic partnerships with a broad range of users of data science including the Lloyd’s Register Foundation, Intel, GCHQ and HSBC. We are looking to develop partnerships with government departments and have recently announced a collaboration with the Office of National Statistics.
We invite you to join us as we grow our research community, supporting our goal to develop the next generation of data science leaders, shape the public conversation, and push the boundaries of this new science for the public good.
APPLICATION PROCEDURE
If you are interested in this opportunity, please send your CV, and a covering letter to . Please also arrange for two references to be sent directly to this e-mail address by the closing date at the latest.. If you have questions or would like to discuss the role further with a member of the Institute’s HR Team, please contact them on 0203 862 3322 or email .
CLOSING DATE FOR APPLICATIONS: 23rd OCTOBER 2017
Please note all offers of employment are subject tocontinuous eligibility to work in the UKand satisfactorypre-employmentsecurity screening which includes a DBS Check.
Full details on the pre-employment screening process can be requested from .