Merge multiple separate robot demonstration recording sessions into one unified LeRobot dataset.
Audit a robot video dataset for frozen frames, bad dimensions, or near-black content before model training.
Record robot demonstrations with an Orbbec Femto Bolt depth camera and store RGB-D data alongside standard video.
Upload a cleaned robot training dataset to Hugging Face with support for resuming large transfers.
Requires Python 3.12, ffmpeg, an Orbbec Femto Bolt depth camera for RGB-D capture, and a configured LeRobot environment.
This toolkit helps researchers and roboticists build training datasets for robot learning models, specifically the kind that learn from video and robot movement data together. The project sits on top of LeRobot, an open-source framework from Hugging Face for collecting and storing robot demonstration data. A VLA model (Vision-Language-Action) is a type of AI that watches video, reads instructions, and decides how to move a robot arm. To train one, you need large amounts of carefully organized recordings. The toolkit handles the full pipeline from raw recordings to a published dataset. It supports two camera types: standard RGB color video and RGB-D, which adds a depth channel so the system can also record how far away objects are. The depth support is built around the Orbbec Femto Bolt camera and stores depth as lossless image files alongside the standard video. Both types work within the same workflow. The main steps the toolkit covers are merging, auditing, cleaning, and uploading. Merging takes multiple separate recording sessions (each stored as its own folder) and combines them into one unified dataset, rewriting all the internal indices and metadata so nothing conflicts. Auditing runs quality checks on the resulting dataset: it scans each video for empty files, wrong dimensions, frozen frames, nearly black or white content, and similar problems, then sorts episodes into keep, review, and drop lists. Cleaning builds a fresh copy of the dataset that excludes the dropped episodes. Uploading pushes the final result to Hugging Face with support for resuming large transfers. There is also a capture overlay that slots into the official LeRobot repository so you can record Orbbec depth data directly through LeRobot's existing recording scripts. You copy the overlay files into your LeRobot checkout and install the extra dependencies, then use a config file to point it at your camera and robot ports. The project requires Python 3.12 or newer and depends on ffmpeg for video processing. A sample merged RGB-D dataset collected with this toolkit is already published on Hugging Face under the name lerobot_derek_depth.
← ly-derekx on gitmyhub — every repo by this author, as a profile.
Verify against the repo before relying on details.