Machine learning in non-stationary environments

Explore techniques for developing models that can perform well on data that significantly differs from the training data.

Start date: October 2016
Category: Fundamental Research
Contact point: Okba Bekhelifi -

Problem description

Develop novel approaches to building machine learning systems capable of reliable inference on test data that follows a different distribution from the training data. Apply these approaches to brain-computer interfaces and robotics control.

Current approaches include covariate shift adaptation [1] and stationary subspace analysis [2].

Why this problem matters

The universal assumption underlying machine learning is that the phenomenon being modeled is stationary and that the training data at hand is representative of it, i.e. that the data on which the model will be tested follows the same distribution as the training data. This doesn’t hold true for several important practical applications, in particular brain-computer interfaces (where there are large discrepancies in brain signal patterns across different individuals) and robotics control (where robot control policy may be trained in simulated environments or controlled environments different from the deployment environment). This leads to a repetitive requirement of additional data collection and calibration before any serious use of such systems.

Solving this issue would help brain-computer interfaces to become mainstream. Additionally, it would allow for training robotics systems on simulation data while retaining real-world capabilities, which would prove immensely valuable for the entire field of robotics.