LLM on Kaiwen Che

LLM on Kaiwen Chehttps://kaiwenche.github.io/tags/llm/Recent content in LLM on Kaiwen CheHugoen-usTue, 07 Apr 2026 00:00:00 -0700Can Coding Agents Learn Editorial Taste with RLVR?https://kaiwenche.github.io/posts/rlvr-resume-agent/Tue, 07 Apr 2026 00:00:00 -0700https://kaiwenche.github.io/posts/rlvr-resume-agent/Training a Qwen3-14B resume revision agent with RLVR and an LLM judge. Tool mastery comes fast, content quality learns slow, and the rubric tensions reveal what reward design really means for creative domains.