Analysis updated 2026-07-03
Run a local AI chatbot inside a .NET application without sending any user data to a cloud service.
Add document question-answering to an ASP.NET app by indexing your own files and letting a local model answer questions about them using RAG.
Integrate a local language model into a Unity game, WPF desktop app, or Blazor web app using the provided example projects.
| scisharp/llamasharp | mattparkerdev/sharpide | oskardudycz/eventsourcing.netcore | |
|---|---|---|---|
| Stars | 3,677 | 3,681 | 3,672 |
| Language | C# | C# | C# |
| Setup difficulty | moderate | hard | moderate |
| Complexity | 3/5 | 3/5 | 4/5 |
| Audience | developer | developer | developer |
Figures from each repo's GitHub metadata at analysis time.
Requires downloading a GGUF model file separately from Hugging Face and choosing the right hardware backend package.
LLamaSharp is a C# library that lets .NET developers run AI language models directly on their own computer or server, without sending data to a cloud service. It is built on top of a lower-level tool called llama.cpp, which handles the actual computation, and LLamaSharp wraps it with a friendlier API for .NET applications. The name refers to LLaMA, a family of open-weight AI language models originally released by Meta, though the library works with other compatible models as well. Installation comes in two parts. First you install the main LLamaSharp package from NuGet, which is the standard .NET package manager. Then you install a backend package that matches your hardware: a CPU-only backend that works on Windows, Linux, and Mac, a CUDA 11 or CUDA 12 backend for Nvidia GPUs on Windows and Linux, a Vulkan backend for other GPUs, or the CPU backend on Mac, which also uses the Metal GPU acceleration built into Apple hardware. No C++ compilation is required. Models must be in a format called GGUF. If a model you want to use is in a different format, you can convert it, but many pre-converted GGUF files are available to download directly from Hugging Face. Once set up, the library lets you load a model and generate text responses, hold multi-turn conversations, and process both text and images with multimodal models. It integrates with several Microsoft tools: Semantic Kernel, which is a framework for building AI-assisted applications, and Kernel Memory, which adds the ability to index and search documents so the model can answer questions about your own content. That pattern, where a model retrieves relevant documents before answering, is called RAG. The project also works with other frameworks including LangChain for .NET and BotSharp. Example projects in the repository show integrations with ASP.NET web applications, WPF desktop apps, Blazor, and Unity. A community Discord server and a QQ group are available for questions and support.
A C# library that lets .NET developers run AI language models locally on their own machine without cloud services, wrapping llama.cpp with a friendly API for Windows, Linux, and Mac.
Mainly C#. The stack also includes C#, .NET, llama.cpp.
License details not mentioned in the explanation.
Setup difficulty is rated moderate, with roughly 30min to a first successful run.
Mainly developer.
This repo across BitVibe Labs
Verify against the repo before relying on details.