Stars
AirLLM 70B inference with single 4GB GPU
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
Stable Diffusion web UI
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and Generation
Visual Studio Code Extension for DevChat
The fully compliant, embeddable high-performance Go MQTT v5 server for IoT, smarthome, and pubsub
An Open Source Machine Learning Framework for Everyone
VSCode插件:自动生成,自动更新VSCode文件头部注释, 自动生成函数注释并支持提取函数参数,支持所有主流语言,文档齐全,使用简单,配置灵活方便,持续维护多年。
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge manageme…
Large Language Model Text Generation Inference
A high-throughput and memory-efficient inference and serving engine for LLMs
Maid is a cross-platform Flutter app for interfacing with GGUF / llama.cpp models locally, and with Ollama and OpenAI models remotely.
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
fyabc / vllm
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
WCDB is a cross-platform database framework developed by WeChat.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
This repo hosts the source for the DirectX Shader Compiler which is based on LLVM/Clang.
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc.…
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型