๐Ÿ“š Weekly AI Paper Digest

๊ธฐ๊ฐ„: 2026-04-06 ~ 2026-04-11 ์„ ์ •: ์ด๋ฒˆ ์ฃผ ๊ฐ€์žฅ ์ฃผ๋ชฉ๋ฐ›์€ ๋…ผ๋ฌธ Top 5


๐Ÿ† ์ด๋ฒˆ ์ฃผ Top 5

์ˆœ์œ„๋…ผ๋ฌธโฌ†๏ธDeep Dive
๐Ÿฅ‡GrandCode: Achieving Grandmaster Level iโ€ฆ347DD-061
๐ŸฅˆAdamโ€™s Law: Textual Frequency Law on Larโ€ฆ309DD-062
๐Ÿฅ‰Video-MME-v2: Towards the Next Stage in โ€ฆ224DD-063
4.InCoder-32B-Thinking: Industrial Code Woโ€ฆ224DD-064
5.SkillClaw: Let Skills Evolve Collectivelโ€ฆ215DD-065

๐Ÿ” ์ด๋ฒˆ ์ฃผ ํŠธ๋ Œ๋“œ

ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ

  • ์—์ด์ „ํŠธํ˜• ์ง„ํ™”์™€ ๊ฐ•ํ™”ํ•™์Šต: ๋‹จ์ˆœํ•œ ์–ธ์–ด ๋ชจ๋ธ์„ ๋„˜์–ด, ์Šค์Šค๋กœ ํ•™์Šตํ•˜๊ณ  ๊ธฐ์ˆ (Skill)์„ ์ง„ํ™”์‹œํ‚ค๋Š” ์—์ด์ „ํŠธ ์‹œ์Šคํ…œ์˜ ๋ถ€์ƒ
  • ๊ณ ๋‚œ๋„ ์ฝ”๋“œ ๋ฐ ์‚ฐ์—…์  ์ถ”๋ก : ๊ฒฝ์Ÿ ํ”„๋กœ๊ทธ๋ž˜๋ฐ ๊ทธ๋žœ๋“œ๋งˆ์Šคํ„ฐ ์ˆ˜์ค€ ๋„์ „ ๋ฐ ์นฉ ์„ค๊ณ„ ๋“ฑ ์‹ค์ œ ์‚ฐ์—… ํ˜„์žฅ์˜ ๋ณต์žกํ•œ ํ•˜๋“œ์›จ์–ด ์ œ์•ฝ ์กฐ๊ฑด์„ ์ดํ•ดํ•˜๋Š” ์ถ”๋ก  ๋Šฅ๋ ฅ
  • ๋ฒค์น˜๋งˆํ‚น์˜ ์žฌ์ •์˜์™€ ํฌํ™” ํ˜„์ƒ ํ•ด๊ฒฐ: ๊ธฐ์กด ๋ฒค์น˜๋งˆํฌ์˜ ์ ์ˆ˜ ์ธํ”Œ๋ ˆ์ด์…˜์„ ์ง€์ ํ•˜๊ณ , ๋ชจ๋ธ์˜ ์‹ค์ œ ๋‚ด๊ตฌ์„ฑ๊ณผ ์‹ ๋ขฐ์„ฑ์„ ํ‰๊ฐ€ํ•˜๋Š” ๋” ์—„๊ฒฉํ•œ ํ‰๊ฐ€ ๊ธฐ์ค€ ์ œ์‹œ
  • ํ…์ŠคํŠธ ์ฃผํŒŒ์ˆ˜ ๋ฒ•์น™(TFL): ์ธ๊ฐ„์˜ ์ธ์ง€ ๊ณผ์ •๊ณผ ์œ ์‚ฌํ•˜๊ฒŒ ํ…์ŠคํŠธ ๋นˆ๋„๊ฐ€ LLM์˜ ์„ฑ๋Šฅ์— ๋ฏธ์น˜๋Š” ์˜ํ–ฅ์„ ์ฒด๊ณ„ํ™”ํ•œ ์ด๋ก ์  ๋ฐœ๊ฒฌ

๊ณตํ†ต ์ฃผ์ œ

์ด๋ฒˆ ์ฃผ ๋…ผ๋ฌธ๋“ค์€ ๋‹จ์ˆœํ•œ ํ…์ŠคํŠธ ์ƒ์„ฑ์„ ๋„˜์–ด, **โ€˜๋ณต์žกํ•œ ๋ฌธ์ œ ํ•ด๊ฒฐ์„ ์œ„ํ•œ ๋™์  ์‹œ์Šคํ…œโ€™**๊ณผ **โ€˜ํ‰๊ฐ€์˜ ์ •๊ตํ™”โ€™**์— ์ง‘์ค‘ํ•˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ์ฝ”๋”ฉ๊ณผ ๋น„๋””์˜ค ์ดํ•ด์™€ ๊ฐ™์€ ๊ณ ๋‚œ๋„ ๊ณผ์ œ์—์„œ ์ •์  ๋ชจ๋ธ์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜๊ธฐ ์œ„ํ•ด ๊ฐ•ํ™”ํ•™์Šต(RL), ์—์ด์ „ํŠธ ์ง„ํ™”, ์ „๋ฌธํ™”๋œ ์ถ”๋ก  ์ฒด์ธ์„ ๋„์ž…ํ•˜๊ณ  ์žˆ์œผ๋ฉฐ, ๋™์‹œ์— ๋ฒค์น˜๋งˆํฌ์˜ ์‹ ๋ขฐ์„ฑ์„ ํ™•๋ณดํ•˜๊ณ  ๋ชจ๋ธ์˜ ๊ธฐ๋ณธ์ ์ธ ์ž‘๋™ ์›๋ฆฌ(๋นˆ๋„ ๋ฒ•์น™)๋ฅผ ๊ทœ๋ช…ํ•˜๋ ค๋Š” ๋…ธ๋ ฅ์ด ๊ณตํ†ต์ ์œผ๋กœ ๋‚˜ํƒ€๋‚ฉ๋‹ˆ๋‹ค.

์ฃผ๋ชฉํ•  ์ 

ํŠนํžˆ GrandCode์™€ SkillClaw์—์„œ ๋ณผ ์ˆ˜ ์žˆ๋“ฏ์ด, AI๊ฐ€ ์‚ฌ์šฉ์ž์˜ ํ”ผ๋“œ๋ฐฑ์ด๋‚˜ ํ™˜๊ฒฝ๊ณผ์˜ ์ƒํ˜ธ์ž‘์šฉ์„ ํ†ตํ•ด ๋ฐฐ์šด ์ง€์‹์„ ์ถ•์ ํ•˜๊ณ  ์Šค์Šค๋กœ ์„ฑ๋Šฅ์„ ํ–ฅ์ƒ์‹œํ‚ค๋Š” โ€˜์ž์œจ ์ง„ํ™”(Autonomous Evolution)โ€™ ๊ฐœ๋…์ด ๊ตฌ์ฒดํ™”๋˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค. ๋˜ํ•œ Video-MME-v2์™€ Adamโ€™s Law๋Š” ํ˜„์žฌ์˜ AI ํ‰๊ฐ€ ๋ฐฉ์‹์ด ๊ฐ€์ง„ ํ—ˆ์ƒ์„ ๊นจ๊ณ , ๋ชจ๋ธ์ด ์‹ค์ œ ์„ธ๊ณ„(Real-world)์—์„œ ์–ผ๋งˆ๋‚˜ ๊ฐ•๊ฑดํ•˜๊ณ  ์‹ ๋ขฐํ•  ์ˆ˜ ์žˆ๋Š”์ง€๋ฅผ ์ธก์ •ํ•˜๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ์—ฐ๊ตฌ๊ฐ€ ์ด๋™ํ•˜๊ณ  ์žˆ์Œ์„ ์‹œ์‚ฌํ•ฉ๋‹ˆ๋‹ค.

์‹ค๋ฌด ์‹œ์‚ฌ์ 

๊ฐœ๋ฐœ์ž์™€ ์—ฐ๊ตฌ์ž๋Š” ๋ชจ๋ธ์˜ ํฌ๊ธฐ๋งŒ ํ‚ค์šฐ๋Š” ๊ฒƒ์—์„œ ๋ฒ—์–ด๋‚˜, ๋ชจ๋ธ์ด **์Šค์Šค๋กœ ๊ฒฝํ—˜์„ ํ†ตํ•ด ํ•™์Šตํ•  ์ˆ˜ ์žˆ๋Š” ์—์ด์ „ํŠธ ์•„ํ‚คํ…์ฒ˜(์˜ˆ: RL, ECoT)**๋ฅผ ์„ค๊ณ„ํ•˜๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ๊ด€์‹ฌ์„ ๊ฐ€์ ธ์•ผ ํ•ฉ๋‹ˆ๋‹ค. ์ฝ”๋”ฉ์ด๋‚˜ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ํƒœ์Šคํฌ๋ฅผ ์ˆ˜ํ–‰ํ•  ๋•Œ๋Š” ๊ธฐ์กด ๋ฒค์น˜๋งˆํฌ ์ ์ˆ˜๋ณด๋‹ค ์‹ค์ œ ๋ณต์žกํ•œ ์ƒํ™ฉ์—์„œ์˜ **์ถ”๋ก  ๊ณผ์ •๊ณผ ๋‚ด๊ตฌ์„ฑ(Robustness)**์„ ๊ฒ€์ฆํ•˜๋Š” ๊ฒƒ์ด ์ค‘์š”ํ•ด์กŒ์Šต๋‹ˆ๋‹ค. ์•„์šธ๋Ÿฌ ํ…์ŠคํŠธ ๋ฐ์ดํ„ฐ์˜ ๋นˆ๋„ ํŠน์„ฑ์„ ์ดํ•ดํ•˜๋Š” ๊ฒƒ์€ ๋ชจ๋ธ์˜ ํšจ์œจ์ ์ธ ํ•™์Šต ๋ฐ ์ตœ์ ํ™”์— ์ƒˆ๋กœ์šด ํ†ต์ฐฐ์„ ์ œ๊ณตํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.


๐Ÿ“‘ ๋…ผ๋ฌธ๋ณ„ ์š”์•ฝ

๐Ÿฅ‡ 1. GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

arXiv: 2604.02721 | โฌ†๏ธ 347 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: agentic-rl competitive-programming llm multi-agent-system reinforcement-learning grandcode code-generation grpo

์ด ๋…ผ๋ฌธ์ด ์ค‘์š”ํ•œ ์ด์œ ๋Š” ๋‹จ์ผ ๋ชจ๋ธ์˜ ํ•œ๊ณ„๋ฅผ ๋„˜์–ด, ๋‹ค์ค‘ ์—์ด์ „ํŠธ ๊ฐ•ํ™” ํ•™์Šต(Agentic Reinforcement Learning)์„ ํ†ตํ•ด ์‹ค์‹œ๊ฐ„ ๊ฒฝ์ง„ ๋Œ€ํšŒ ํ™˜๊ฒฝ์—์„œ ์ธ๊ฐ„ ์ตœ๊ณ  ์ˆ˜์ค€์˜ ๊ทธ๋žœ๋“œ๋งˆ์Šคํ„ฐ(Grandmaster)๋ฅผ ์ฒ˜์Œ์œผ๋กœ ์ œ์••ํ•œ AI ์‹œ์Šคํ…œ์ธ GrandCode๋ฅผ ์†Œ๊ฐœํ–ˆ๊ธฐ ๋•Œ๋ฌธ์ž…๋‹ˆ๋‹ค.

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


๐Ÿฅˆ 2. Adamโ€™s Law: Textual Frequency Law on Large Language Models

arXiv: 2604.02176 | โฌ†๏ธ 309 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: llm textual-frequency adam-law prompting fine-tuning nlp paraphrasing

์ด ๋…ผ๋ฌธ์€ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ(LLM)์˜ ์„ฑ๋Šฅ์„ ๋†’์ด๊ธฐ ์œ„ํ•ด ๋ฐ์ดํ„ฐ์˜ ์–‘์ด๋‚˜ ์งˆ๋ฟ๋งŒ ์•„๋‹ˆ๋ผ ํ…์ŠคํŠธ์˜ ๋นˆ๋„์ˆ˜(Textual Frequency)๊ฐ€ ์ค‘์š”ํ•˜๋ฉฐ, ์ž…๋ ฅ์„ ๋” ํ”ํ•˜๊ฒŒ ์“ฐ์ด๋Š” ํ‘œํ˜„์œผ๋กœ ๋ฐ”๊พธ๋Š” ๊ฒƒ์ด ํšจ๊ณผ์ ์ด๋ผ๋Š” ์•„๋‹ด์˜ ๋ฒ•์น™(Adamโ€™s Law)์„ ์ œ์•ˆํ–ˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


๐Ÿฅ‰ 3. Video-MME-v2: Towards the Next Stage in Benchmarks for Comprehensive Video Understanding

arXiv: 2604.05015 | โฌ†๏ธ 224 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: video-mme-v2 benchmark video-understanding mllm evaluation data-contamination temporal-reasoning

๊ธฐ์กด ๋ฒค์น˜๋งˆํฌ์˜ ๋ฐ์ดํ„ฐ ์˜ค์—ผ๊ณผ ๋‹จ์ˆœ์„ฑ์œผ๋กœ ์ธํ•œ ์ ์ˆ˜ ๋ถ€ํ’€๋ฆผ ๋ฌธ์ œ๋ฅผ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด, ์ตœ์‹  ๋ฐ์ดํ„ฐ์™€ ๋‚œ์ด๋„ ๊ณ„์ธต ๊ตฌ์กฐ๋ฅผ ๋„์ž…ํ•˜์—ฌ ๋น„๋””์˜ค ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋Œ€ํ˜• ์–ธ์–ด ๋ชจ๋ธ์˜ ์ง„์ •ํ•œ ์ถ”๋ก  ๋Šฅ๋ ฅ์„ ํ‰๊ฐ€ํ•  ์ˆ˜ ์žˆ๋Š” ์ƒˆ๋กœ์šด ํ‘œ์ค€์ธ Video-MME-v2๋ฅผ ์ œ์•ˆํ–ˆ๊ธฐ ๋•Œ๋ฌธ์— ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


4. 4. InCoder-32B-Thinking: Industrial Code World Model for Thinking

arXiv: 2604.03144 | โฌ†๏ธ 224 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: industrial-code world-model chain-of-thought reasoning gpu-optimization verilog error-correction llm

์ด ๋…ผ๋ฌธ์€ ์ผ๋ฐ˜์ ์ธ ์ฝ”๋”ฉ ๋Šฅ๋ ฅ์„ ๋„˜์–ด ํ•˜๋“œ์›จ์–ด ์ œ์•ฝ๊ณผ ์‹คํ–‰ ํ™˜๊ฒฝ์„ ์ดํ•ดํ•ด์•ผ ํ•˜๋Š” ์‚ฐ์—…์šฉ ์ฝ”๋“œ ์ƒ์„ฑ ์˜์—ญ์—์„œ, ์ถ”๋ก  ๋Šฅ๋ ฅ๊ณผ ์ฝ”๋“œ๊ฐ€ ์‹คํ–‰๋˜๋Š” ํ™˜๊ฒฝ์˜ ๋™์—ญํ•™์„ ํ•™์Šตํ•œ ์›”๋“œ ๋ชจ๋ธ์„ ๊ฒฐํ•ฉํ•˜์—ฌ ๊ธฐ์กด ๋ชจ๋ธ๋“ค์˜ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ–ˆ๊ธฐ ๋•Œ๋ฌธ์— ๋งค์šฐ ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


5. 5. SkillClaw: Let Skills Evolve Collectively with Agentic Evolver

arXiv: 2604.08377 | โฌ†๏ธ 215 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: llm-agent skill-evolution self-improving-system openclaw tool-learning collective-intelligence automated-debugging prompt-engineering

ํ˜„์žฌ์˜ ์ •์ ์ธ ๋Œ€๊ทœ๋ชจ ์–ธ์–ด ๋ชจ๋ธ ์—์ด์ „ํŠธ(Large Language Model Agent) ์‹œ์Šคํ…œ์ด ๊ฐ€์ง„ ํ•œ๊ณ„๋ฅผ ๊ทน๋ณตํ•˜์—ฌ, ์‚ฌ์šฉ์ž๋“ค์˜ ์ƒํ˜ธ์ž‘์šฉ์„ ํ†ตํ•ด ์Šคํ‚ฌ(Skill)์ด ์ž๋™์œผ๋กœ ๊ฐœ์„ ๋˜๊ณ  ๊ณต์œ ๋˜๋Š” ์ž๊ฐ€ ์ง„ํ™”ํ˜• ์—์ด์ „ํŠธ ํ”„๋ ˆ์ž„์›Œํฌ๋ฅผ ์ œ์‹œํ–ˆ๊ธฐ ๋•Œ๋ฌธ์— ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


๐Ÿ“… ์ƒ์„ฑ์ผ: 2026-04-12 | ๐Ÿค– GLM-4.7 Weekly Digest