hub

CodeZen

CodeZen

一个专注中文区的 GitHub 项目发现

avatar

LongWriter

LongWriter:释放长上下文LLM的10,000+字生成能力 🤗 HF 仓库 • 📃 论文 • 🚀 HF 空间 English./README.md | 中文./README_zh.md | 日本語./README_jp.md https://github.com/user-attachments/assets/c7eedeca-98ed-43ec-8619-25137987bcde 左:LongWriter-glm4-9b;右:GLM-4-9B-chat 🔥 更新 **2024年8月18日** 您现在可以使用vllmhttps://github.com/vllm-

fine-tuning llm long-context long-text
star1.8k
Python
avatar

CyberScraper-2077

A Powerful web scraper powered by LLM | OpenAI, Gemini & Ollama

ai-scraping gemini-api llm llm-scraper openai
star1.9k
Python
avatar

beeai-framework

Build production-ready AI agents in both Python and Typescript.

agents ai ai-agent beeai framework
star2.9k
Python
avatar

optillm

Optimizing inference proxy for LLMs

agent agentic-ai agentic-framework agentic-workflow agents
star3.1k
Python
avatar

LLMForEverybody

每个人都能看懂的大模型知识分享,LLMs春/秋招大模型面试前必看,让你和面试官侃侃而谈

agent interview-practice interview-questions llm rag
star4.7k
Jupyter Notebook
avatar

nexa-sdk

Run the latest LLMs and VLMs across GPU, NPU, and CPU with PC (Python/C++) & mobile (Android & iOS) support, running quickly with OpenAI gpt-oss, Granite4, Qwen3VL, Gemma 3n and more.

gemma3 go gpt-oss granite4 llama
star5.6k
Go
avatar

humanlayer

The best way to get AI coding agents to solve hard problems in complex codebases.

agents ai amp claude-code codex
star6.7k
TypeScript
avatar

NarratoAI

利用AI大模型,一键解说并剪辑视频; Using AI models to automatically provide commentary and edit videos with a single click.

aiagent aiops gemini-api llm moviepy
star7.1k
Python
avatar

note-gen

NoteGen 一款跨平台的 Markdown AI 笔记软件 致力于使用 AI 建立记录和写作的桥梁

chatbot knowledge-base llm markdown mcp
star9.9k
TypeScript
avatar

mastra

The TypeScript AI agent framework. ⚡ Assistants, RAG, observability. Supports any LLM: GPT-4, Claude, Gemini, Llama.

agents ai chatbots evals javascript
star18.1k
TypeScript
avatar

harbor

Effortlessly run LLM backends, APIs, frontends, and services with one command.

ai bash cli container docker
star2.1k
Python
avatar

ramalama

RamaLama is an open-source developer tool that simplifies the local serving of AI models from any source and facilitates their use for inference in production, all through the familiar language of containers.

ai containers cuda hacktoberfest hip
star2.3k
Python