hub

CodeZen

CodeZen

一个专注中文区的 GitHub 项目发现

avatar

infinity

Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali

bert-embeddings llm text-embeddings
star2.5k
Python
avatar

hallucination-leaderboard

Leaderboard Comparing LLM Performance at Producing Hallucinations when Summarizing Short Documents

generative-ai hallucinations llm
star2.8k
Python
avatar

lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

fine-tuning gpt llama llm llm-inference
star3.5k
Python
avatar

GenerativeAIExamples

Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.

gpu-acceleration large-language-models llm llm-inference microservice
star3.6k
Jupyter Notebook
avatar

text-embeddings-inference

A blazing fast inference solution for text embeddings models

ai embeddings huggingface llm ml
star4.2k
Rust
avatar

baml

The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)

baml boundaryml guardrails llm llm-playground
star6.7k
Rust
avatar

plandex

Open source AI coding agent. Designed for large projects and real world tasks.

ai ai-agents ai-developer-tools ai-tools cli
star14.6k
Go
avatar

letta

Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.

ai ai-agents llm llm-agent
star19.1k
Python
avatar

open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

ai llm llm-ui llm-webui llms
star114.6k
JavaScript
avatar

aici

AICI: Prompts as (Wasm) Programs

ai inference language-model llm llm-framework
star2.1k
Rust
avatar

FinGLM

FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。

chatglm finacial gpt llama llm
star2.1k
HTML
avatar

langserve

LangServe 🦜️🏓

deployment fastapi langchain langchain-python llm
star2.2k
JavaScript