hub

CodeZen

CodeZen

一个专注中文区的 GitHub 项目发现

avatar

SageAttention

[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-to-end metrics across language, image, and video models.

attention cuda efficient-attention inference-acceleration llm
star2.6k
Cuda
avatar

text-extract-api

Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown

anonymization api extract json llm
star2.9k
Python
avatar

Sidekick

A native macOS app that allows users to chat with a local LLM that can respond with information from files, folders and websites on your Mac without installing any other software. Powered by llama.cpp.

agentic-ai agents ai ai-agents aichat
star3.1k
Swift
avatar

Integuru

The first AI agent that builds permissionless integrations through reverse engineering platforms' internal APIs.

agent agents ai-agent ai-agents api
star4.5k
Python
avatar

Prompt_Engineering

This repository offers a comprehensive collection of tutorials and implementations for Prompt Engineering techniques, ranging from fundamental concepts to advanced strategies. It serves as an essential resource for mastering the art of effectively communicating with and leveraging large language models in AI applications.

ai genai llm llms opeani
star6.8k
Jupyter Notebook
avatar

Janus

Janus-Series: Unified Multimodal Understanding and Generation Models

any-to-any foundation-models llm multimodal unified-model
star17.6k
Python
avatar

suna

Kortix – Open Source Platform to Build, Manage and Train AI Agents **The complete platform for creating autonomous AI agents that work for you** Kortix is a comprehensive open source platform that empowers you to build, manage, and train sophisticated AI agents for any use case. Create powerful

ai ai-agents llm
star18.5k
TypeScript
avatar

LightRAG

🚀 LightRAG: Simple and Fast Retrieval-Augmented Generation --- 🎉 News - x 2025.11.05🎯📢Add **RAGAS-based** Evaluation Framework and **Langfus

genai gpt gpt-4 graphrag knowledge-graph
star22.5k
Python
avatar

browser-use

🌐 Make websites accessible for AI agents. Automate tasks online with ease.

ai-agents ai-tools browser-automation browser-use llm
star72.3k
Python
avatar

trench

Trench — Open-Source Analytics Infrastructure. A single production-ready Docker image built on ClickHouse, Kafka, and Node.js for tracking events. Easily build product analytics dashboards, LLM RAGs, observability platforms, or any other analytics product.

analytics clickhouse clickhouse-database clickhouse-server dashboard
star1.6k
TypeScript
avatar

amazon-q-developer-cli

✨ Agentic chat experience in your terminal. Build applications using natural language.

agent ai amazon-q cli linux
star1.8k
Rust
avatar

ml-retreat

Machine Learning Journal for Intermediate to Advanced Topics.

data-science documentation large-language-models learning-resources llm
star2.2k
Jupyter Notebook