Center For Perceptual Systems
Rubin Face or Vase Illusion
Home Prospective Graduate Students Research Austin Life  UT Home Page What's Happening?
About Benjamin Kuipers
Bootstrap Learning
Benjamin Kuipers Publications
Benjamin Kuipers Links

William James [1890] wrote, “The baby, assailed by eyes, ears, nose, skin, and entrails at once, feels it all as one great blooming, buzzing confusion.”

Future robots will face the same problem. They will include mobile robots and intelligent vehicles, self-monitoring spacecraft, whole-building environmental monitoring systems, and arrays of sensors spread over large regions of earth, sea, or space. They will be equipped with increasingly rich sensory systems. MEMS technology will provide complex distributed sensor arrays with irregular, variable structure more like biological systems than like engineered systems. Long-lived robots will need to adapt to deteriorating (or improving) sensor systems. Intelligent systems such as these must learn the properties of their sensors and effectors, and adapt to changes, through their own experiences in the environment.

We are working to develop a learning agent with a domain-independent set of mathematical tools that can start with an uninterpreted sensorimotor interface to its environment, and can learn a hierarchy of representations for its experience and its world. The agent will take an incremental “bootstrap learning” approach that combines multiple machine learning algorithms to converge on the desired state of knowledge. The success criterion is for the agent's model of its sensorimotor system and its environment to support reliable planning, allowing the agent to formulate and achieve high-level goals.

Our Spatial Semantic Hierarchy (SSH) [Kuipers, AIJ, 2000] provides the target for this learning process. We assume that the structure of the environment itself, as perceived by the robot's sensors, will define locally distinctive states that can be reached by hill-climbing control laws from anywhere within their local neighborhoods, and qualitatively uniform segments of the environment within which trajectory-following control laws can take the robot reliably from one distinctive state to the neighborhood of another. The robot travels in its environment by selecting control laws to move reliably from one distinctive state to another. These elements naturally abstract to a causal graph of states and actions, and to a topological graph of places and paths. Both graphs may be annotated with local measurements, and local metrical maps may be constructed for the neighborhoods of distinctive states and places. The multi-layer map structure in the SSH eliminates the problem of cumulative estimated position error. Hill-climbing control laws eliminate small amounts of accumulating uncertainty, making it possible for the causal and topological maps to represent position in terms of graph nodes and arcs.

Spatial Semantic Hierarchy


Figure 1 shows the multiple representations of the Spatial Semantic Hierarchy, and their dependencies.

Our approach exploits the continuity of interaction with the physical world, rather than assuming a discretizing abstraction from the beginning. We exploit the properties of dynamical systems such as control laws to define qualitative abstractions matched to the structure of the environment.

The problem for our learning agent can then be restated as the problem of learning the sensory features and motor commands necessary for hill-climbing and trajectory-following control laws, starting with uninterpreted sensors and effectors in an unknown environment.

In previous work [Pierce & Kuipers, AIJ, 1997], we solved a version of this problem for a simulated mobile robot with a variety of sensors including a ring of sonar range-sensors. The agent learned the structure of its sensory system, including the sonar ring; then it learned the effects of actions and identified a small set of primitive actions; finally it identified local state variables and defined homing, open-loop and closed-loop control laws. These control laws were sufficient to define distinctive states, and thus to bootstrap to the Spatial Semantic Hierarchy.

Intermediate Representations


Figure 2 shows the various intermediate representations developed during the bootstrap learning process.

Stages of Behavior


Figure 3 (a,b,c) are stages of behavior during bootstrap learning:

     (a) random wandering
     (b) open-loop control laws
     (c) closed-loop control laws

Bootstrap learning requires choreographing several machine learning algorithms. For example, to learn the structure of the sensor ring, we applied a similarity measure based on correlation to assess the similarities among the sensors, then used multi-dimensional scaling to project the sensors into a high-dimensional space with positions consistent with their similarities. Principal component analysis (PCA) made it possible to select two dimensions that captured most of the variance, so the sensor positions were projected into those two dimensions. This step is the critical ontological change that introduces spatial position for sonar sensors. With this shift, we can estimate spatial as well as temporal changes, and motion becomes a meaningful concept. With a characterization of motion, it becomes possible to collect motion data from actions and apply PCA again to identify a small set of primitive actions.

Our goal in our current work is to extend this approach, adding new learning methods, scaling up to physical robots, with real sensors including vision, operating in real environments.