egocs-400k/dataset

Analysis updated 2026-05-18

★ 45PythonAudience · researcherComplexity · 4/5Setup · moderate

Mindmap

mindmap
  root((EgoCS-400K))
    What it is
      CS2 gameplay video
      10000 plus hours
      40000 plus rounds
    Annotation Layers
      Per-tick game state
      Keyboard and mouse inputs
      Atomic action labels
      Natural language captions
    Data Structure
      Player round trajectories
      Coherent video segments
      Protected action chains
    Access
      Hugging Face dataset
      Interactive viewer
      arXiv technical report
    Audience
      AI researchers
      World model training

mindmap root((EgoCS-400K)) What it is CS2 gameplay video 10000 plus hours 40000 plus rounds Annotation Layers Per-tick game state Keyboard and mouse inputs Atomic action labels Natural language captions Data Structure Player round trajectories Coherent video segments Protected action chains Access Hugging Face dataset Interactive viewer arXiv technical report Audience AI researchers World model training

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Train an AI model to recognize in-game actions like firing or reloading from synchronized first-person video and game state.

USE CASE 2

Build a video-language alignment model using EgoCS-400K's paired gameplay clips and natural-language captions.

USE CASE 3

Research player behavior modeling by analyzing per-tick keyboard, mouse, and game-state traces from competitive CS2 matches.

What is it built with?

PythonHugging FacePyTorch

How does it compare?

	egocs-400k/dataset	lorenliu13/claude-code-for-hydrology	arccalc/dwmfix
Stars	45	44	43
Language	Python	Python	Python
Setup difficulty	moderate	easy	easy
Complexity	4/5	2/5	2/5
Audience	researcher	researcher	general

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · moderate Time to first run · 1h+

Requires a Hugging Face account to download, annotation processing code is not yet publicly released.

The README does not specify a license for this dataset.

In plain English

EgoCS-400K is a large research dataset built for training AI models that understand how games are played. It focuses on Counter-Strike 2, the competitive first-person shooter, capturing over 10,000 hours of first-person gameplay video across more than 40,000 match rounds on 13 maps. What makes this dataset unusual is the depth of synchronized data attached to each video clip. Every frame is paired with the exact keyboard and mouse inputs the player pressed at that moment, the game state at tick-level granularity (32 ticks per second, capturing position, weapon, health, and round context), labels for individual actions like firing or reloading, groupings of actions that belong together, and natural-language captions describing what is happening. This combination lets AI researchers train models that connect what a player sees to what they do. The data is organized into a hierarchy of scales: full-round player trajectories, model-ready video clips with stable temporal boundaries, protected action chains (groups of actions that should not be split during training), individual atomic actions, and raw per-tick state traces. A dynamic programming algorithm creates the clip boundaries to avoid cutting sequences mid-action. The dataset is publicly available on Hugging Face, and an interactive viewer on Hugging Face Spaces lets you browse videos alongside their synchronized annotations. A technical report on arXiv covers the construction pipeline in detail. Processing and annotation code are planned for release as the project continues to expand. This is a niche resource aimed at AI researchers working on video understanding, action recognition, or world models for interactive environments. It is not a playable game, a bot, or a tool for casual use.

Copy-paste prompts

Prompt 1

I want to fine-tune a video understanding model on EgoCS-400K. How do I download it from Hugging Face and write a data loader for per-tick action labels?

Prompt 2

Show me how to use EgoCS-400K's annotation format to train a model that predicts the next player action from first-person video and game state.

Prompt 3

How does the EgoCS-400K protected action chain concept work, and how do I use it to avoid splitting meaningful sequences during clip sampling?

Prompt 4

I want to evaluate a world model on EgoCS-400K's segment data. What metrics should I use and how are DP segment boundaries stored in the JSON files?

Prompt 5

Help me visualize per-tick telemetry from an EgoCS-400K sample alongside the video, matching keyboard inputs to specific video frames.

Frequently asked questions

What is dataset?

EgoCS-400K is a research dataset of 10,000+ hours of Counter-Strike first-person gameplay video, paired with per-tick game state, keyboard inputs, action labels, and captions for training AI world models.

What language is dataset written in?

Mainly Python. The stack also includes Python, Hugging Face, PyTorch.

What license does dataset use?

The README does not specify a license for this dataset.

How hard is dataset to set up?

Setup difficulty is rated moderate, with roughly 1h+ to a first successful run.

Who is dataset for?

Mainly researcher.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub egocs-400k on gitmyhub

Verify against the repo before relying on details.