Export your Telegram history, fine-tune a local model on it, and deploy a bot that replies to new messages in your exact phrasing and style.
Create a private digital twin trained entirely on your own hardware so your conversations never leave your machine.
Build a Slack or Discord bot that impersonates your communication style for fun, archival, or personal-assistant purposes.
Requires a CUDA 12.6 GPU with enough VRAM for the chosen model size, larger models give noticeably better output quality.
WeClone is an end-to-end toolkit for turning your own chat history into a chatbot that talks the way you do. You feed it a dump of your past messages from a platform like Telegram, the tool cleans the data and fine-tunes a large language model on it, and then you can plug the resulting model back into a chat service as a bot that mimics your phrasing, vocabulary, and replying style, what the project calls your digital avatar. The pipeline covers every step: exporting chat data, preprocessing it (including stripping out personal information like phone numbers, email addresses, credit cards, IP addresses, locations, and bank or wallet addresses using Microsoft Presidio plus a user-defined blocklist), fine-tuning a model, and deploying the result. By default it uses the Qwen2.5-VL-7B-Instruct multimodal model and the LoRA technique for supervised fine-tuning, but you can swap in any other model supported by LLaMA Factory, and the README provides a table of VRAM requirements from 7B up to 70B. After training, the bot can be deployed to Telegram, Discord, Slack, or personal WeChat accounts, WhatsApp support is under construction. You would reach for WeClone if you want a private, locally trained digital twin of yourself for fun, archival, or assistant purposes, and you would rather keep your conversations on your own hardware than send them through a cloud service. The project is written in Python, currently supports Telegram as the main data source, uses uv as its environment manager and expects CUDA 12.6 or newer for GPU training. The README warns that larger models trained on more data give noticeably better results. The full README is longer than what was provided.
← xming521 on gitmyhub — every repo by this author, as a profile.
Verify against the repo before relying on details.