sebastianruder/nlp-progress

Analysis updated 2026-05-18

★ 22,972PythonAudience · researcherComplexity · 1/5LicenseSetup · easy

Mindmap

mindmap
  root((repo))
    What it does
      Tracks state of art
      Lists benchmarks
      Records best scores
      Covers many tasks
    NLP Tasks
      Translation
      Question answering
      Named entity recognition
      Summarization
      Sentiment analysis
    Languages covered
      English
      Chinese
      Spanish
      French
      Hindi
    Use cases
      Choose baseline model
      Understand task scope
      Find improvement gaps
      Compare approaches

mindmap root((repo)) What it does Tracks state of art Lists benchmarks Records best scores Covers many tasks NLP Tasks Translation Question answering Named entity recognition Summarization Sentiment analysis Languages covered English Chinese Spanish French Hindi Use cases Choose baseline model Understand task scope Find improvement gaps Compare approaches

Click or tap to explore — scroll the page freely

What do people build with it?

USE CASE 1

Find the best-performing model for a specific NLP task like machine translation or question-answering to use as a baseline.

USE CASE 2

Identify benchmark datasets and evaluation metrics for an NLP problem you're working on.

USE CASE 3

Discover how much room for improvement exists in a particular language processing task.

USE CASE 4

Compare different approaches and models to decide which direction to pursue for a new NLP project.

What is it built with?

Python

How does it compare?

	sebastianruder/nlp-progress	vonng/ddia	modelcontextprotocol/python-sdk
Stars	22,972	23,006	22,898
Language	Python	Python	Python
Setup difficulty	easy	easy	easy
Complexity	1/5	1/5	3/5
Audience	researcher	developer	developer

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · easy Time to first run · 5min

Use freely for any purpose including commercial, as long as you keep the copyright notice.

In plain English

This repository is a community-maintained reference tracking the best-known results in Natural Language Processing (NLP), the field of AI concerned with understanding and generating human language. NLP is a broad field covering many specific tasks: translating text between languages, answering questions, detecting who or what is mentioned in text, summarizing documents, recognizing speech, analyzing sentiment, and dozens more. For each task, the repository lists the standard benchmark datasets used to evaluate AI models, describes what the task involves, and records the best scores achieved by published research, this is called the "state of the art" (SOTA). It covers tasks for multiple languages including English, Chinese, Vietnamese, Hindi, French, Spanish, Korean, and others. You would use this if you are an AI researcher or engineer looking to understand what problems exist in NLP, which datasets are used to test solutions, and how well current methods perform. It serves as a starting point for choosing which approach or model to build on for a new NLP project, or to understand how much room for improvement remains in a given task. This is a reading and reference resource, not runnable software. Contributions from the community are welcome.

Copy-paste prompts

Prompt 1

Show me the state-of-the-art results for named entity recognition on English datasets from the nlp-progress repository.

Prompt 2

What are the standard benchmark datasets used to evaluate machine translation models according to nlp-progress?

Prompt 3

Find the best-performing models for sentiment analysis tasks and their scores from the nlp-progress reference.

Prompt 4

List the NLP tasks covered in nlp-progress and identify which ones have the largest gap between current performance and human performance.

Prompt 5

What languages does nlp-progress track, and which language has the most NLP tasks with published benchmarks?

Frequently asked questions

What is nlp-progress?

A community-maintained reference tracking the best-known results and benchmarks across Natural Language Processing tasks like translation, question-answering, and sentiment analysis.

What language is nlp-progress written in?

Mainly Python. The stack also includes Python.

What license does nlp-progress use?

Use freely for any purpose including commercial, as long as you keep the copyright notice.

How hard is nlp-progress to set up?

Setup difficulty is rated easy, with roughly 5min to a first successful run.

Who is nlp-progress for?

Mainly researcher.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub sebastianruder on gitmyhub

Verify against the repo before relying on details.