IMPASTO: Integrating Model-Based Planning with Learned Dynamics Models for Robotic Oil Painting Reproduction

Yingke Wang1, Hao Li1, Yifeng Zhu2, Koven Yu1, Ken Goldberg3,
Li Fei-Fei1, Jiajun Wu1, Yunzhu Li4, Ruohan Zhang1


1 Stanford University
2 The University of Texas at Austin
3 University of California, Berkeley
4 Columbia University
IMPASTO Pearl
IMPASTO Landscape

Abstract

Robotic reproduction of oil paintings using soft brushes and pigments requires force-sensitive control of deformable tools, prediction of brushstroke effects, and multi-step stroke planning, often without human step-by-step demonstrations or faithful simulators. Given only a target oil painting image, can a robot infer and execute the stroke trajectories, forces, and colors needed to reproduce it? We present IMPASTO, a robotic oil-painting system that integrates learned pixel dynamics models with model-based planning. The dynamics models predict canvas updates from image observations and parameterized stroke actions; a receding-horizon model predictive control optimizer then plans trajectories and forces, while a force-sensitive controller executes strokes on a 7-DoF robot arm. IMPASTO learns solely from robot self-play and achieves high-fidelity replication on human artists' single-stroke datasets and multi-stroke artworks, outperforming baselines in reproduction accuracy. By integrating low-level force control, learned dynamics models, and high-level closed-loop planning, IMPASTO is a step toward robots that can paint with the finesse of a human artist by manipulating real brushes and paints.

Overview of IMPASTO

IMPASTO method overview

IMPASTO is a robotic oil painting system that integrates learned neural dynamics models with model-based planning algorithms to accurately replicate human artists' brushstrokes and artworks.

Method

IMPASTO Landscape

To concisely and expressively represent the strokes, we define 10 parameters including starting positions, tilting angle, RGBA, length, bend, and most importantly, force, to form the stroke as either a straight line or a parabolic curve. We then control the arm in joint-impedance mode with force injection and run a force-control loop along the surface normal. At the beginning of each stroke, a normal-direction admittance accumulates a smoothed feedforward force.

IMPASTO method

Overview of the learning and planning framework. Top: IMPASTO-UNet's neural pixel dynamics model, which combines an image encoder and an action encoder to predict the effect of a stroke. The model is trained using a weighted l1 loss. Bottom: To find one or more consecutive strokes between a base image and a target image, an MPC-based planner optimizes stroke parameters with a weighted l1 image objective in a receding-horizon, closed loop.

Independent Stroke Replication

IMPASTO ind

Target brushstrokes from five human artists (two examples per artist from the overall 60 strokes) and the strokes reproduced by the robot using different methods. The numbers shown are the weighted l1 loss between the target and the painted strokes. Instances with the best performances are highlighted with bold borders. Overall, IMPASTO-UNet more accurately reproduced human brushstrokes.

Overlaid Stroke Replication

IMPASTO overlaid

Qualitative results showing the target strokes from overlaid strokes and the strokes painted by the robot using different methods. Instances with the best performances are highlighted with bold borders. The base images (canvas states) already have painted strokes. This requires the dynamics models to make accurate predictions given the noisy background. The numbers shown are the weighted l1 loss. Note that the loss is only calculated around the target stroke area. IMPASTO-UNet is more accurate in reproducing human brushstrokes given the noisy base images.

Closed-loop Multi-step Replication

IMPASTO planning

Qualitative results showing the target paintings and the painting produced by the robot using IMPASTO vs. FRIDA. The planning horizon was set to be five, and the framed images are the targets for MPC. Our method can more accurately reproduce oil paintings.

Open-loop Single-step Replication

IMPASTO demo2

IMPASTO was able to replicate a complex oil painting with single-step planning.

Failure cases

Failure mode: worn-out brush and overly dry or wet pigments introduce stochasticity during execution even though the prediction and planning are accurate.

[Task-Conditioned Results Title]

Evaluation Results A
Evaluation Results B

[Placeholder caption...]

Interactive Demo

Demo will be available after review process...

[Interactive Demo Placeholder - Will be available after review]

Generalization Properties for [Application]

Using our proposed method as a [type of representation], policies learned from [training data description] generalize to [generalization type 1], [generalization type 2], [generalization type 3], and [generalization type 4].

[Task 1 Name]: [Task 1 Description]

Training Scenario

[Generalization Type 1]

[Generalization Type 2]
([Example Description])

[Generalization Type 3]
([Example Variation])

[Generalization Type 4]
([Example Change])

[Task 2 Name]: [Task 2 Description]

Training Scenario

[Generalization Type 1]

[Generalization Type 2]

[Generalization Type 3]
([Example Variation])

[Task 3 Name]: [Task 3 Description]

Training Scenario

[Generalization Type 1]

[Generalization Type 2] ([Example])

[Generalization Type 3]
([Example Variation])

[Generalization Type 4]
([Example Change])

Real-World Experiments

We further demonstrate our method's effectiveness in real-world scenarios. Each experiment uses [training setup description].

[Task 1 Name]: [Task 1 Real-World Description]

[Task 2 Name]: [Task 2 Real-World Description]

[Task 3 Name]: [Task 3 Real-World Description]