Skip to content
View he-yufeng's full-sized avatar

Block or report he-yufeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
he-yufeng/README.md

Hey, I'm Yufeng He 👋

Agentic AI Researcher @ Moonshot AI (Kimi) | MS CS @ HKU | Champion, Shanghai Global AI Contest | 3x ACM-ICPC Silver Medalist | Former Intern @ Baidu, Maimai, Kuaishou

Projects

Project Stars What it does
CoreCoder 528+ 512K lines of Claude Code → 1,400 lines of Python. Every key architectural pattern, runnable. Any LLM. 7 architecture deep-dive articles.
FindJobs-Agent 158+ LLM-powered job toolkit: skill gap analysis, mock interviews, resume optimization
MTSIR3-GAN 134+ Adapting R3GAN (NeurIPS 2024) to multivariate time series imputation + adversarial refinement study
ContractGuard 99+ AI agent that reviews contracts for red flags before you sign
DRL-MultiFactorTrading 74+ Deep RL trading strategies with Double DQN and multi-factor alpha signals
Arxiv Paper Coding Agent 37+ Multi-LLM crew that reads a paper and produces a working implementation
AgentProbe 34+ Pytest plugin for regression-testing AI agents: snapshot baselines, semantic comparison, mock LLMs
PromptDiff 25+ Semantic diff for LLM prompts, compare prompt versions like git diff
RuleForge 24+ Auto-generate AI assistant rules (CLAUDE.md, .cursorrules) from codebase analysis
GitSense 23+ AI-powered open source contribution finder, tell it your skills, it finds issues
TokenTracker 18+ Drop-in LLM cost tracker, change one import line, see where your money goes
AnyCoder 15+ AI coding agent CLI supporting 100+ LLMs via litellm. ~1,450 lines Python, dangerous command blocking, parallel tool execution, session persistence.
BatchLLM 15+ Batch LLM processing: CSV in, processed CSV out, with retries and cost tracking
RepoWiki new Open-source DeepWiki alternative: generate wiki docs for any codebase from CLI or browser. PageRank file ranking, Mermaid diagrams, reading guides.
CodeABC 2+ AI code reader for non-programmers: drag in a project, get plain-language explanations with hover annotations

Open Source Contributions

Project PR What I Fixed
Mooncake (5k★) #1629 GB200 MNNVL EP hang: cudaMalloccuMemCreate(FABRIC) + cuMemMap for cross-node NVLink
Mooncake (5k★) #1728 Hard pin for eviction-protected objects: model weights never get evicted, const field + BatchEvict skip + backward-compat serialization
Mooncake (5k★) #1644 MNNVL warmup hang: skip redundant handshake for fabric-connected nodes
Mooncake (5k★) #1831 TENT NVLink IPC fix: use base pointer for sub-allocated GPU tensors, porting #1622 fix to TENT path
Mooncake (5k★) #1825 Fix P2PClientService::Put silently swallowing write errors: propagate actual error codes for non-idempotent failures
Mooncake (5k★) #1626 Silenced error log spam for non-memory replicas in metadata store
vLLM (74k★) #37699 Fix weight offloading ignoring VLLM_WEIGHT_OFFLOADING_DISABLE_PIN_MEMORY in prefetch offloader
vLLM (74k★) #37884 RoBERTa position_ids in-place accumulation on CUDA graph padding: BGE-M3 crash after ~4000 requests
vLLM (74k★) #37301 Base64 JPEG video frames returning empty metadata: populate frame count, fps, duration
Transformers (158k★) #44710 Fix AutoProcessor.from_pretrained silently dropping hub kwargs (revision, token, etc.)
FlashInfer (5k★) #2756 Fix autotuner crash when input tensor is None: proper None-checking for optional inputs (fixes #2749)
FlashInfer (5k★) #2772 Fix compilation error: add missing <optional> header for std::optional usage in CUTLASS headers
vLLM (74k★) #37727 Fix Responses API instructions leaking across turns via previous_response_id chain
AstrBot (28k★) #6596 Multimodal token counting: images, audio, chain-of-thought were invisible to context compression
AstrBot (28k★) #6474 SQLite database is locked under concurrent writes: added busy timeout
AstrBot (28k★) #7003 SSE heartbeat for WebChat: long context compression killed the connection
AstrBot (28k★) #6581 Context truncation dropping the only user message: causes 400 from Zhipu/Gemini
AstrBot (28k★) #6656 /stop follow-up race: agent_stop flag not checked during follow-up capture
AstrBot (28k★) #6710 Skills-like re-query dropping image captions: extra_user_content_parts not forwarded
AstrBot (28k★) #6527 Fix LLM tool selection: rewrote ambiguous Upload/Download descriptions
AstrBot (28k★) #6313 Null choices guard: OpenAI API returning None instead of empty list
AstrBot (28k★) #7217 Fix qwen3-rerank response parsing: handle both old and new Dashscope API formats
AstrBot (28k★) #7216 Fix Gemini tool call 400: wrap plain-text tool results as Protobuf Struct
AstrBot (28k★) #7196 Fix Gemini thinking parts leaking into user-facing response
AstrBot (28k★) #7398 Fix Telegram sendMessageDraft spamming 400 errors on empty text in streaming mode
AstrBot (28k★) #7407 Fix Gemini native search 400 when no function tools: skip FunctionCallingConfig
AstrBot (28k★) #6551 Fix empty content causing Grok 400: set content to None when empty
ClawTeam (4.1k★) #1 First PR: 122 tests, CI, team templates, config bugfixes, task duration tracking
ClawTeam (4.1k★) #40 Pluggable TaskStore: extract task persistence into swappable backend abstraction
ClawTeam (4.1k★) #32 Gemini CLI support: spawn, permissions, prompt injection for both backends
ClawTeam (4.1k★) #36 Kimi CLI support: spawn backend, permission handling, 3 new test cases
ClawTeam (4.1k★) #24 Team snapshot/restore: checkpoint and recovery for long-running agent swarms
LightRAG (31k★) #2796 Fix None file_path propagating as unknown_source: fill gaps left by #2793
OpenClaw (343k★) #41271 Log auth profile resolution failures instead of swallowing silently

Hi,我是何宇峰 👋

Moonshot AI (Kimi) AI Agent 研究员 | 港大计算机硕士 | 上海全球AI大赛冠军 | 三次获ACM-ICPC银牌 | 曾在百度、脉脉、快手的AI 研发岗实习

项目

项目 Stars 简介
CoreCoder 528+ Claude Code 51万行源码 → 1400行 Python 核心重写,支持任意大模型,附 7 篇架构导读
FindJobs-Agent 158+ LLM 求职工具箱:技能差距分析、模拟面试、简历优化
MTSIR3-GAN 134+ 将 R3GAN (NeurIPS 2024) 适配到多变量时序插补 + 对抗精炼实证研究
ContractGuard 99+ AI 合同审查 Agent,签字前帮你找出霸王条款
DRL-MultiFactorTrading 74+ 深度强化学习量化交易,Double DQN + 多因子 Alpha
Arxiv Paper Coding Agent 37+ 多 LLM 协作 Agent,读论文自动生成可运行代码
AgentProbe 34+ AI Agent 回归测试 pytest 插件:快照基线、语义比较、Mock LLM
PromptDiff 25+ LLM prompt 语义 diff 工具,像 git diff 一样对比 prompt 版本
RuleForge 24+ 从代码库自动生成 AI 助手规则(CLAUDE.md、.cursorrules)
GitSense 23+ AI 驱动的开源贡献发现器,输入技能自动找 issue
TokenTracker 18+ LLM 开销追踪,改一行 import 就能用
AnyCoder 15+ 终端 AI 编程 Agent,通过 litellm 支持 100+ 大模型,~1450 行 Python,危险命令拦截、并行执行、会话持久化
BatchLLM 15+ LLM 批量处理:CSV 进 CSV 出,自动重试 + 费用追踪
RepoWiki new 开源 DeepWiki 替代品:CLI 或浏览器生成仓库 wiki 文档,PageRank 文件排名、Mermaid 架构图、阅读指南
CodeABC 2+ 面向非程序员的 AI 代码阅读器:拖入项目,生成大白话逐行批注

开源贡献

项目 PR 修了啥
Mooncake (5k★) #1629 GB200 MNNVL EP hang:cudaMalloccuMemCreate(FABRIC) + cuMemMap 跨节点 NVLink 通信
Mooncake (5k★) #1728 Hard pin 驱逐保护:模型权重永不被驱逐,const 字段 + BatchEvict 跳过 + 向后兼容序列化
Mooncake (5k★) #1644 MNNVL warmup hang:跳过 fabric 连接节点的冗余握手
Mooncake (5k★) #1831 TENT NVLink IPC 修复:sub-allocated GPU tensor 使用 base pointer,将 #1622 修复移植到 TENT 路径
Mooncake (5k★) #1825 修复 P2PClientService::Put 静默吞掉写入错误:传播实际错误码给调用方
Mooncake (5k★) #1626 修复非内存副本的错误日志刷屏
vLLM (74k★) #37699 修复 weight offloading 忽略 VLLM_WEIGHT_OFFLOADING_DISABLE_PIN_MEMORY 环境变量
vLLM (74k★) #37884 修复 RoBERTa position_ids 原地累积溢出:BGE-M3 约 4000 请求后 CUDA graph padding crash
vLLM (74k★) #37301 修复 base64 JPEG 视频帧返回空 metadata:补充帧数、fps、时长
Transformers (158k★) #44710 修复 AutoProcessor.from_pretrained 静默丢弃 hub kwargs(revisiontoken 等)
FlashInfer (5k★) #2756 修复 autotuner 在输入 tensor 为 None 时崩溃(fixes #2749)
FlashInfer (5k★) #2772 修复编译错误:CUTLASS 头文件缺少 <optional> include 导致 std::optional 未定义
vLLM (74k★) #37727 修复 Responses API 的 instructions 通过 previous_response_id 链泄漏到后续轮次
AstrBot (28k★) #6596 多模态 token 计数:图片/音频/思考链对 context 压缩不可见的问题
AstrBot (28k★) #6474 修复 SQLite 并发写入 database is locked,添加 busy timeout
AstrBot (28k★) #7003 修复 WebChat 长响应断连:SSE 心跳保活,context 压缩期间不再超时
AstrBot (28k★) #6581 修复截断器丢失唯一 user 消息导致智谱/Gemini 返回 400
AstrBot (28k★) #6656 修复 /stop 后新消息仍被 follow-up 捕获的竞态条件
AstrBot (28k★) #6710 修复 skills-like re-query 丢失图片描述:extra_user_content_parts 未传递
AstrBot (28k★) #6527 修复 LLM 工具选择:重写模糊的 Upload/Download 描述
AstrBot (28k★) #6313 修复 OpenAI API 返回 None choices 导致崩溃
AstrBot (28k★) #7217 修复 qwen3-rerank 响应解析:兼容新旧百炼 API 格式
AstrBot (28k★) #7216 修复 Gemini tool call 400:纯文本 tool result 包装为 Protobuf Struct
AstrBot (28k★) #7196 修复 Gemini thinking parts 泄漏到用户可见的消息内容中
AstrBot (28k★) #7398 修复 Telegram 流式模式下空文本 sendMessageDraft 导致 400 错误刷屏
AstrBot (28k★) #7407 修复 Gemini 原生搜索无 function tools 时 400:跳过 FunctionCallingConfig
AstrBot (28k★) #6551 修复空 content 导致 Grok 400:content 为空时设为 None
ClawTeam (4.1k★) #1 首个 PR:122 个测试、CI、团队模板、config 修复、任务耗时追踪
ClawTeam (4.1k★) #40 可插拔 TaskStore:将任务持久化抽取为可替换的后端抽象层
ClawTeam (4.1k★) #32 Gemini CLI 支持:spawn、权限、prompt 注入双 backend 适配
ClawTeam (4.1k★) #36 Kimi CLI 支持:spawn backend、权限处理、3 个新测试
ClawTeam (4.1k★) #24 团队快照/恢复:长时间 agent swarm 的检查点和恢复机制
LightRAG (31k★) #2796 修复 None file_path 传播为 unknown_source:补 #2793 遗漏的处理层
OpenClaw (343k★) #41271 认证配置解析失败时记录日志而非静默吞掉

Pinned Loading

  1. FindJobs-Agent FindJobs-Agent Public

    LLM-powered toolkit for skill analysis, AI interviews, resume scoring, and job structuring. Automates professional skill taxonomy and interview processes with adaptive difficulty.

    Python 181 19

  2. CoreCoder CoreCoder Public

    Minimal AI coding agent (~950 LoC Python) inspired by Claude Code. Works with any LLM. Think NanoGPT for coding agent. Formerly NanoCoder.

    Python 551 149

  3. ContractGuard ContractGuard Public

    AI agent that reads the fine print so you don't have to. Upload any contract → get red flags, unfair terms, and plain-English explanations in seconds.

    Python 99 12