ymcui/chinese-llama-alpaca-2

★ 7,142PythonAudience · researcherComplexity · 4/5Setup · hard

Mindmap

mindmap
  root((repo))
    What it does
      Chinese AI models
      Base and chat versions
      Extended context
    Model sizes
      1.3B parameter
      13B parameter
      RLHF aligned
    Tech Stack
      Python
      transformers
      llama.cpp
    Use Cases
      Chinese chat apps
      Research fine-tuning
      Local CPU inference

mindmap root((repo)) What it does Chinese AI models Base and chat versions Extended context Model sizes 1.3B parameter 13B parameter RLHF aligned Tech Stack Python transformers llama.cpp Use Cases Chinese chat apps Research fine-tuning Local CPU inference

Click or tap to explore — scroll the page freely

Things people build with this

USE CASE 1

Run a Chinese-language chat assistant locally on your computer using quantized models

USE CASE 2

Fine-tune the base model on your own Chinese dataset for a custom NLP task

USE CASE 3

Build a Chinese-language question-answering or writing assistant into your app

USE CASE 4

Test extended-context reading of long Chinese documents up to 64,000 tokens

Tech stack

PythonPyTorchtransformersllama.cppvLLM

Getting it running

Difficulty · hard Time to first run · 1h+

Requires a GPU with sufficient VRAM, smaller models can be quantized to run on CPU but will be slower.

In plain English

This repository provides Chinese-language versions of two AI language models: LLaMA-2 and Alpaca-2. LLaMA-2 is a foundational text model released by Meta, Alpaca-2 is a version of that model further trained to follow instructions and hold conversations, similar to how a chat assistant works. The models here have been adapted to work much better with Chinese text by expanding the vocabulary they understand and training them on large amounts of Chinese data. There are two main categories of models: base models, which are good at continuing text given a prompt, and chat or instruction models, which are better at answering questions, writing, and back-and-forth conversation. Several sizes are available, from smaller 1.3 billion parameter models to larger 13 billion parameter ones. There are also extended-context versions that can read and generate much longer passages of text, with some supporting up to 64,000 tokens of context at once, which is roughly equivalent to a short novel. The models can be run on a personal computer using techniques that compress them to use less memory. The repository includes scripts for pre-training and fine-tuning, so researchers and developers can train their own variants. It is compatible with a range of popular tools in the AI community, such as transformers, llama.cpp, and vLLM. Some models in this project have also been trained with a technique called RLHF, which uses human feedback to make the model's responses more aligned with human values and preferences. The README is written primarily in Chinese and the project is aimed at Chinese-language AI research and application development. The full README is longer than what was shown.

Copy-paste prompts

Prompt 1

Using the ymcui/chinese-llama-alpaca-2 13B chat model, help me draft a formal business email in Chinese requesting a meeting with a client.

Prompt 2

I have a Chinese customer support dataset. Walk me through fine-tuning the chinese-alpaca-2 base model on it using the included training scripts.

Prompt 3

Using the chinese-llama-alpaca-2 model with llama.cpp, show me how to run inference on a CPU with 8GB RAM using quantization.

Prompt 4

I want to summarize a long Chinese legal document (around 30,000 characters). Which chinese-llama-alpaca-2 model variant should I use and how do I load it?

Prompt 5

Compare the base model vs the chat model from chinese-llama-alpaca-2, when should I use each one for my Chinese text generation project?

Open on GitHub → Explain another repo

← ymcui on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.