PhD position in Paris Saclay Univ
Title:
Iterative and Expressive Querying for Big Data Series
Research Areas:
Data Management Systems/Databases, Interactive Visualization, Human-Computer Interaction
Summary:
Domains such as astronomy, and genome sequencing, are currently collecting a staggering amount of data, a significant percentage of which is in the form of data series. To make sense of it, scientists need to interactively explore them, by formulating hypotheses and progressively refining them. Our goal is to add iterative and expressive query mechanisms to big data-series collections. We propose novel interaction and visualization techniques for data series exploration and analysis, and focus on their scalability to multi-terabyte data-series collections. To this end, the thesis will develop interactive tools that allow analysts to express vagueness in their queries and then refine them in an iterative manner. The thesis will also introduce techniques for visualizing high-cardinality query results. Existing data-series indexing algorithms will need to be revised to accommodate the above requirements.
Requirements:
Masters (or equivalent) degree in databases/data management systems or related field, fluency in written and spoken English, solid programming skills. Any background in visualization and human-computer interaction or UI development experience is a plus. The candidate should be enthusiastic about research. Knowledge of French is not required.
Coordinators:
Themis Palpanas, Paris-Descartes University
http://www.mi.parisdescartes.fr/~themisp
Anastasia Bezerianos, University of Paris-Sud, Inria / Paris-Saclay
https://www.lri.fr/~anab/
Starting Date and Funding:
The thesis is expected to start in October 2016. The duration of a Ph.D. thesis in France is 3 years (maximum 4 years). Full funding is provided, including expenses for international conferences and other major research events.
Application Process:
Contact the thesis coordinators (see above) by April 20, 2016 by including an updated CV and a letter of interest.
Work Environment:
The work will be conducted between the Paris Descartes University, located in the heart of Paris, and Inria located at the campus of Paris-Saclay, one of the largest research and business clusters in Europe.
The diNo team (Paris Descartes University) directed by Themis Palpanas has world-class expertise on problems related to data series management, indexing, and analysis. It has developed the current state of the art data series indexes, experimentally demonstrating scalability to dataset sizes in excess of 1 billion data series, which is 2-3 orders of magnitude more than previous approaches. The team is applying these techniques to real-world problems, and has ongoing collaborations in this area with neuroscientists (Paris Descartes University), astrophysicists (Paris Diderot University), and Facebook.
The HCC team (Paris-Saclay) is part of the DigiScope Equipex project on the use of connected, large display platforms for collaborative analysis of large amounts of information. Anastasia Bezerianos has several international research collaborations related to large data visualization and interaction that have led to publications in the top HCI and visualization venues. These include works on visual perception and graphical chart understanding, data exploration, uncertainty visualization, and temporal visualization and understanding. Theophanis Tsandilas (https://www.lri.fr/~fanis/) has a long experience on sketch-based interfaces, expressive graphical representations, and creativity-support tools.