ravenscroftj/turbopilot

★ 3,790C++Audience · developerComplexity · 3/5Setup · moderate

Mindmap

mindmap
  root((TurboPilot))
    What it does
      Local code completion
      No cloud needed
      Copilot-compatible API
    Models supported
      Codegen
      WizardCoder
      Starcoder
      StableCode
    How to run it
      Pre-built binary
      Docker container
      GPU via CUDA
    Audience
      Privacy-focused devs
      Self-hosters
      No-subscription users

mindmap root((TurboPilot)) What it does Local code completion No cloud needed Copilot-compatible API Models supported Codegen WizardCoder Starcoder StableCode How to run it Pre-built binary Docker container GPU via CUDA Audience Privacy-focused devs Self-hosters No-subscription users

Click or tap to explore — scroll the page freely

Things people build with this

USE CASE 1

Get AI code suggestions inside VS Code without sending your code to any external server or paying for a subscription.

USE CASE 2

Run a private, offline code completion tool on a laptop with at least 4 GB of RAM using just a CPU.

USE CASE 3

Use GPU acceleration via Docker to get faster code suggestions on a machine with an NVIDIA graphics card.

USE CASE 4

Swap in a local AI backend for the vscode-fauxpilot extension as a free, self-hosted Copilot alternative.

Tech stack

C++llama.cppDockerCUDAOpenAI-compatible APIHugging Face modelsVS Code extension

Getting it running

Difficulty · moderate Time to first run · 1h+

Requires downloading a model separately from Hugging Face (several GB). Use the pre-built binary for simplest setup, or Docker for GPU support. Project is archived, no new updates.

Open source project, check the repository for the specific license. Now archived and no longer maintained.

In plain English

TurboPilot was an open source project that let developers run an AI code completion tool entirely on their own computer, without sending code to any external server. It worked as a local alternative to GitHub Copilot, the popular AI coding assistant that runs in the cloud. The project is now archived and no longer maintained as of September 2023, with the author noting that more mature alternatives are available. While it was active, TurboPilot ran large language models trained on code directly on a regular CPU, needing as little as 4 gigabytes of RAM for smaller models. It supported several models, including Salesforce Codegen, WizardCoder, Starcoder, and StableCode. Models were downloaded separately from Hugging Face, a public repository for machine learning models, and then loaded by the TurboPilot server process. Users with more RAM or a capable GPU could run larger, more accurate models. The server started on a local port and presented an API compatible with the Copilot and OpenAI format. This meant it could work as a drop-in backend for VS Code extensions like vscode-fauxpilot, which would send code to the local server instead of to GitHub's servers. The result was code completion suggestions that appeared inline as you typed, all processed on your own machine. Installation came in two forms: a pre-built binary for direct execution, or a Docker container image. The Docker version also supported NVIDIA GPU acceleration via CUDA, which significantly increased the speed of generating suggestions. For GPU use, specific CUDA versions of the container image were provided. The project was written in C++ and was based on work from the llama.cpp and fauxpilot projects. It was aimed at developers who wanted private, self-hosted code completion without a paid subscription or cloud dependency.

Copy-paste prompts

Prompt 1

I want to set up TurboPilot locally to get AI code completions in VS Code without using GitHub Copilot. Walk me through downloading a model from Hugging Face and connecting it to the vscode-fauxpilot extension.

Prompt 2

How do I run TurboPilot using Docker on a machine with an NVIDIA GPU? Give me the exact steps including which container image tag to use for CUDA support.

Prompt 3

I have 8 GB of RAM. Which TurboPilot-compatible model from Hugging Face should I download for the best balance of speed and code quality, and how do I load it?

Prompt 4

TurboPilot is archived, what are the most mature self-hosted alternatives I can use today that work the same way (local LLM server with a VS Code plugin)?

Prompt 5

Explain how TurboPilot's OpenAI-compatible API works so I can connect my own editor or tool to a locally running code completion model.

Open on GitHub → Explain another repo

← ravenscroftj on gitmyhub — every repo by this author, as a profile.

Verify against the repo before relying on details.