Robot Foundation Model Data Platform

Every signal needed to train robot foundation models.

Capture, validate, annotate, and export multimodal training data for VLAs, diffusion policies, humanoids, and world models — from real robot trajectories to simulation, egocentric human video, retargeted motion, force/tactile, and language annotations.

Export-ready for
GR00Topenpi (π0/π0.5)ACTDiffusion PolicyOpenVLAWorld ModelsHumanoid Whole-Body
30K+ indexed hours8 data classes<5ms cross-modal syncLeRobot · GR00T · HDF5 · MCAP
Bimanual ALOHA · 3-cam · 20 Hz
action[14]
aligned
state[15]
synced
QA
schema pass
Subtask timeline5 spans · auto-derived
reachgraspliftpourplace
Human ego + robot, one schema
150-D hand pose · 30 Hz · consent-cleared egocentric
See sample →
NVIDIA Isaac·FREQ: 100HZ·ROS 2·DEPTH: ACTIVE STEREO·PyTorch·FORMAT: HDF5·Hugging Face·SIM: ISAAC GYM·LeRobot·CAMERAS: 3+·Open X-Embodiment·LATENCY: <20MS·NVIDIA Isaac·FREQ: 100HZ·ROS 2·DEPTH: ACTIVE STEREO·PyTorch·FORMAT: HDF5·Hugging Face·SIM: ISAAC GYM·LeRobot·CAMERAS: 3+·Open X-Embodiment·LATENCY: <20MS·

Training Data Coverage

Heterogeneous signals. One supplier.

Modern robot policies — GR00T, π0/π0.5, OpenVLA, Helix — train on a mix of real teleop, autonomous rollouts, simulation, human video, motion capture, force/tactile, and language. We capture all of it.

Real teleop trajectories
ALOHA, humanoid, single-arm, mobile manipulator
BC, VLA fine-tuning, dexterous policies
Autonomous policy rollouts
Replay capture with success/failure tagging
Post-training, recovery, evaluation
Simulation trajectories
Isaac Lab / MimicGen-style synthetic data
Scale, randomization, sim-to-real
Human egocentric video
Activity clips with consent-cleared release
Semantic priors, world models
Human motion retargeting
Mocap → ALOHA / GR1-class retarget
Humanoid locomotion, whole-body priors
Force / tactile
Wrist F/T at 100 Hz + fingertip arrays
Contact-rich manipulation, dexterity
Language & semantic labels
Operator + hindsight VLM + reviewed spans
VLA instruction following
QA metadata
12-check pipeline, sync drift, calibration
Training reliability, episode weighting

Sample Lab

See the data before you license it.

Four real episodes — bimanual, humanoid, contact-rich dual-arm, and language-conditioned single-arm. Synchronized cameras, action-vs-state curves, joint-space trajectories, and dataset health, end-to-end.

Every inspector loads real captured state, action, timestamps, and video — not a mockup. License credit lives inside each bundle's metadata.

The Platform

Data, pipeline, and services. Unified.

Data Corpus

The Corpus

Real teleop, autonomous rollouts, simulation, human video, retargeted motion, force/tactile and language. Captured across ALOHA bimanual, humanoid, and collaborative robots — every frame production-ready.

8 data classesLeRobot · GR00T · HDF5 · MCAP12-check QA
Pipeline

Processing Pipeline

Raw video in, action tokens out. Our auto-labeling pipeline transforms teleoperation recordings into training-ready datasets.

01
Raw Video
02
Segmentation
03
Action Labels
04
Action Tokens
Services

Teleop Services

Human-in-the-loop dexterity. Our expert operators capture complex manipulation scenarios with sub-20ms teleoperation latency and haptic feedback.

Latency
<20ms
Haptic
Enabled
Operators
Expert
Arms
Dual

Core Technology

End-to-end collection platform.

A crowdsourced pipeline connecting researchers with expert operators and calibrated hardware. Define your task, and we handle the rest.

01

Create Task

Researchers define collection requirements, scenarios, and quality criteria.

02

Match Contractor

System assigns qualified operators based on expertise and equipment.

03

Connect Hardware

Contractor connects to calibrated robot arms, cameras, and sensors.

04

Collect Data

Expert teleoperation capture with real-time monitoring and feedback.

05

Auto-Verify

Automated QA checks for trajectory continuity, drift, and format compliance.

06

Review & Deliver

Task creator reviews quality, provides feedback, data delivered in HDF5/LeRobot.

sample_task.json
{
  "task_type": "bimanual_manipulation",
  "scenarios": ["kitchen_pickup", "table_sort"],
  "episodes_required": 500,
  "quality_criteria": {
    "success_rate": 0.95,
    "max_drift_ms": 5
  },
  "output_format": "hdf5+lerobot"
}
Launch Platformplatform.signiq-lab.ai

Training-Ready Guarantees

Validated, synced, training-ready.

Every episode — teleop, sim, human video, retargeted motion — passes the same automated QA before delivery. What ships is what your policy sees: synchronized, labeled, and formatted for immediate training.

<5ms
Sync Drift

Cross-modal synchronization verified per-frame. RGB, depth, force/torque, and proprioception aligned within 5ms tolerance.

100%
Episode Labels

Every episode tagged success or failure with failure-mode annotation. Train on clean demonstrations, test against edge cases.

8
Data Classes

Real teleop, autonomous rollouts, simulation, human video, retargeted motion, force/tactile, language, QA — captured under one schema.

5 min
To First Train

Native LeRobot v2/v2.1, GR00T-LeRobot, HDF5, and MCAP. Load directly into your training loop — RLDS and robomimic exports available on request.

Automated QA Pipeline

Every episode passes through 12 automated checks before entering the dataset. Rejected episodes are re-collected, not patched.

Trajectory continuity validated — no teleportation artifacts
Joint position limits enforced per-robot URDF spec
Camera intrinsics and extrinsics calibrated per-session
Gripper state binary-labeled at action boundaries
Duplicate and corrupt frames automatically rejected
End-effector 6D pose (XYZ + RPY) computed and verified

Marketplace

Ready-to-train packages.

View all datasets
Featured
aloha

ALOHA Configuration - Home Tasks

700 hours of dual-arm teleoperation data covering bedroom, kitchen, and living room scenarios. Includes synchronized multi-camera RGB observations, high-frequency joint position data, and end-effector pose tracking.

700h
Hours
4.2k
Episodes
3
Cameras
LeRobotHDF5MCAP
Featured
humanoid dual arm

Humanoid Dual-Arm Dataset

600 hours of humanoid dual-arm manipulation across diverse household and retail scenarios. Features depth sensing, force feedback, and 4-camera coverage for complex object interactions.

600h
Hours
3.6k
Episodes
4
Cameras
LeRobotHDF5MCAP
Featured
collaborative dual arm

Collaborative Dual-Arm Collection

900 hours of collaborative dual-arm robot operations for complex manipulation tasks. Optimized for warehouse logistics, bin picking, and sorting operations with depth perception.

900h
Hours
5.4k
Episodes
3
Cameras
LeRobotHDF5MCAP
collaborative single arm

Collaborative Single-Arm Tasks

300 hours of single-arm collaborative robot data for precision assembly and manipulation. Features force feedback for contact-rich tasks and fine motor control.

300h
Hours
1.8k
Episodes
2
Cameras
LeRobotHDF5MCAP

Strategic partners

Deploying with the world's leading teams.

Collaborating with industry leaders to define the future of embodied AI.

UBTECH
Walker S

Walker S Series

Industrial-grade dexterity powered by SignIQ Lab's manipulation dataset. Deployed in major EV manufacturing lines.

X-HUMANOID
Tiangong

Tiangong (天工)

The world's first full-sized electric running humanoid. Training on SignIQ Lab's embodied intelligence platform.

Train on the data, not the data sheet.

Inspect a real episode end-to-end, then talk to our team about ready-to-use datasets or a custom capture run in our instrumented facilities.