explaingit

younesbensafia/arxiv-reader-mcp

14PythonAudience · researcherComplexity · 2/5LicenseSetup · easy

TLDR

A local server that lets AI assistants like Claude search, browse, and read full papers from arXiv, the giant open-access library of science and computer science research, without leaving your editor or chat window.

Mindmap

mindmap
  root((repo))
    What it does
      Search arXiv papers
      Fetch paper metadata
      Read full PDF text
      Browse by category
    Tech stack
      Python
      MCP protocol
      uv runner
      stdio transport
    Use cases
      Research summaries
      Paper discovery
      AI literature review
      Category monitoring
    Audience
      Researchers
      AI tool users
      Developers
      Students
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

Things people build with this

USE CASE 1

Find the latest research papers on a topic like 'diffusion models' or 'quantum computing' directly inside Claude or your AI coding assistant.

USE CASE 2

Ask your AI to summarize a specific arXiv paper by its ID, including methodology and results, without copying and pasting anything.

USE CASE 3

Monitor a research category like cs.LG (machine learning) for recent papers and get a quick digest of what's new.

USE CASE 4

Have an AI read and explain the full text of a paper's PDF, not just the abstract, to help you decide if it's worth reading in depth.

Tech stack

PythonMCPuvarXiv APIPDF extraction

Getting it running

Difficulty · easy Time to first run · 30min

Requires Python and the uv tool. Add a short JSON config block to your MCP client settings pointing at the server's run command. Works with Claude and any MCP-compatible client.

MIT license, free to use, modify, and share for any purpose, including commercial projects. No restrictions.

In plain English

arxiv-reader-mcp is a server that connects AI coding assistants and chat tools to arXiv, the large open-access repository of scientific papers used widely in physics, mathematics, computer science, and related fields. The server uses the Model Context Protocol (MCP), which is a standard that lets AI tools call external services in a structured way. Once installed, an AI assistant can search for papers, pull metadata, and even read the full text of a PDF, all without leaving the editor or conversation window. The server exposes five tools. One searches by keyword, author, category, or date range. A second does a quick keyword search with optional filters. A third fetches the title, authors, abstract, and PDF link for a specific paper ID. A fourth returns recent papers in a chosen subject category. The fifth downloads and extracts the full text from a paper's PDF, which lets an AI assistant read and summarize the actual content rather than just the abstract. arXiv organizes papers by category codes. The README lists common ones: cs.AI for artificial intelligence, cs.LG for machine learning, cs.CV for computer vision, cs.CL for computation and language, and many others across physics, statistics, and biology. Users can filter any search to a specific category. Setting it up requires Python and a tool called uv for running Python projects. The server runs as a local process that communicates with a compatible client over standard input and output. To connect it to Claude on claude.ai, you add a short JSON block to the MCP settings file pointing at the server's run command. The README also mentions an MCP Inspector tool for testing the server interactively before connecting a full client. The project is open source under the MIT license and includes example prompts showing what you can ask an AI once the server is connected: finding recent papers on a topic, summarizing a specific paper by its arXiv ID, or reading the methodology section of a paper in full.

Copy-paste prompts

Prompt 1
Search arXiv for the 5 most recent papers on retrieval-augmented generation (cs.CL category) and summarize what each one is about in plain English.
Prompt 2
Fetch the metadata for arXiv paper 2305.10601, give me the title, authors, abstract, and a link to the PDF.
Prompt 3
Use the arxiv-reader-mcp tool to read the full text of arXiv paper 2406.12345 and explain the methodology section in simple terms.
Prompt 4
Find papers on computer vision (cs.CV) published in the last 30 days that mention 'object detection' and list the top 5 by relevance.
Prompt 5
Read the full PDF of this arXiv paper and pull out the key findings, limitations, and any datasets or benchmarks they used.
Open on GitHub → Explain another repo

← younesbensafia on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.