explaingit

meta-llama/llama

Analysis updated 2026-05-18

59,389PythonAudience · developerComplexity · 3/5Setup · hard

TLDR

Deprecated repository that originally provided Meta's Llama 2 language model weights and inference code. Now redirects to newer maintained repositories.

Mindmap

mindmap
  root((repo))
    What it was
      Llama 2 inference
      Model weights
      PyTorch code
    Model sizes
      7 billion params
      13 billion params
      70 billion params
    Hardware needs
      Single GPU small
      Multiple GPUs large
      CUDA required
    Current status
      Deprecated
      No longer maintained
      Redirects elsewhere
    Where to go now
      llama-models repo
      PurpleLlama safety
      llama-cookbook examples
Click or tap to explore — scroll the page freely

Code map

Detail Auto

An interactive map of this repo's files and how they connect — its source is parsed live in your browser. Click Visualize to build it.

filefunction / class

What do people build with it?

USE CASE 1

Run Llama 2 language model locally on your own hardware without API costs.

USE CASE 2

Fine-tune or adapt Llama 2 for custom tasks using the provided inference framework.

USE CASE 3

Research and experiment with open-weights large language models at different scales.

What is it built with?

PythonPyTorchCUDAtorchrun

How does it compare?

meta-llama/llama666ghj/mirofishdocling-project/docling
Stars59,38959,37359,251
LanguagePythonPythonPython
Setup difficultyhardhardmoderate
Complexity3/54/53/5
Audiencedeveloperpm founderdeveloper

Figures from each repo's GitHub metadata at analysis time.

How do you get it running?

Difficulty · hard Time to first run · 1day+

Repository is deprecated and redirects elsewhere, obtaining Llama 2 weights requires Meta access approval and significant CUDA/PyTorch setup.

License could not be detected automatically. Check the repository's LICENSE file before use.

In plain English

This repository was the original home for Meta's Llama 2 language model inference code, but it is now deprecated. The README itself explains that Meta has consolidated its model repositories and this one is no longer being maintained. The original purpose was to provide the model weights and minimal Python code needed to load and run Llama 2, which was Meta's open-weights large language model ranging from 7 billion to 70 billion parameters. A large language model, or LLM, is an AI system trained on vast amounts of text that can generate coherent, contextually appropriate responses to prompts and questions. When this repository was active, you would download the model weights from Meta's website after accepting a license agreement, then use a command called torchrun to launch the model and send it text prompts to complete or answer. The inference code used PyTorch as the deep learning framework and required CUDA-capable hardware for the larger model sizes. Different model sizes required different numbers of GPUs to run, with the smallest 7-billion-parameter version fitting on a single GPU and the 70-billion-parameter version requiring eight. The project's primary usefulness was giving researchers and developers access to a capable open-weights model they could run locally and adapt without API costs. The README now directs users to newer, actively maintained repositories including llama-models, PurpleLlama for safety tooling, and llama-cookbook for practical usage examples. You would only encounter this repository when following older tutorials or tracing the history of the Llama model family.

Copy-paste prompts

Prompt 1
How do I set up the original Llama 2 inference code from this deprecated repository?
Prompt 2
What are the GPU requirements for running different sizes of Llama 2 models?
Prompt 3
Where should I go now that this Llama repository is no longer maintained?
Prompt 4
How do I load Llama 2 model weights and run inference with PyTorch and torchrun?

Frequently asked questions

What is llama?

Deprecated repository that originally provided Meta's Llama 2 language model weights and inference code. Now redirects to newer maintained repositories.

What language is llama written in?

Mainly Python. The stack also includes Python, PyTorch, CUDA.

What license does llama use?

License could not be detected automatically. Check the repository's LICENSE file before use.

How hard is llama to set up?

Setup difficulty is rated hard, with roughly 1day+ to a first successful run.

Who is llama for?

Mainly developer.

Open on GitHub → Explain another repo

This repo across BitVibe Labs

Scan in gitsafehub Deploy in gitdeployhub meta-llama on gitmyhub

Verify against the repo before relying on details.