๐Ÿ“š Weekly AI Paper Digest

๊ธฐ๊ฐ„: 2026-05-11 ~ 2026-05-16 ์„ ์ •: ์ด๋ฒˆ ์ฃผ ๊ฐ€์žฅ ์ฃผ๋ชฉ๋ฐ›์€ ๋…ผ๋ฌธ Top 5


๐Ÿ† ์ด๋ฒˆ ์ฃผ Top 5

์ˆœ์œ„๋…ผ๋ฌธโฌ†๏ธDeep Dive
๐Ÿฅ‡MinT: Managed Infrastructure for Traininโ€ฆ205DD-087
๐ŸฅˆMean Mode Screaming: Meanโ€”Variance Spliโ€ฆ182DD-088
๐Ÿฅ‰SenseNova-U1: Unifying Multimodal Undersโ€ฆ169DD-089
4.MemPrivacy: Privacy-Preserving Personaliโ€ฆ140DD-090
5.Achieving Gold-Medal-Level Olympiad Reasโ€ฆ137DD-091

๐Ÿ” ์ด๋ฒˆ ์ฃผ ํŠธ๋ Œ๋“œ

ํ•ต์‹ฌ ํ‚ค์›Œ๋“œ

  • ์ดˆ๊ฑฐ๋Œ€ ์ถ”๋ก  ๋Šฅ๋ ฅ (Hyper-scale Reasoning): ์ˆ˜ํ•™ ๋ฐ ๊ณผํ•™ ์˜ฌ๋ฆผํ”ผ์•„๋“œ ์ˆ˜์ค€์˜ ๋ณต์žกํ•œ ๋ฌธ์ œ ํ•ด๊ฒฐ ๋Šฅ๋ ฅ์„ ๋‹จ์ˆœํ•˜๊ณ  ํ†ตํ•ฉ๋œ ์Šค์ผ€์ผ๋ง ๋ฐฉ์‹์œผ๋กœ ๋‹ฌ์„ฑํ•˜๋Š” ์—ฐ๊ตฌ.
  • ์ธํ”„๋ผ ํšจ์œจํ™” (Infrastructure Efficiency): ํ•˜๋‚˜์˜ ๊ฑฐ๋Œ€ ๋ฒ ์ด์Šค ๋ชจ๋ธ์„ ํ†ตํ•ด ์ˆ˜๋ฐฑ๋งŒ ๊ฐœ์˜ ๊ฐœ์ธํ™”๋œ LoRA ์–ด๋Œ‘ํ„ฐ๋ฅผ ํšจ์œจ์ ์œผ๋กœ ํ•™์Šต ๋ฐ ์„œ๋น™ํ•˜๋Š” ๊ด€๋ฆฌํ˜• ์ธํ”„๋ผ ๊ธฐ์ˆ .
  • ์•„ํ‚คํ…์ฒ˜ ํ†ตํ•ฉ ๋ฐ ์‹ฌํ™” (Unification & Deepening): ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋ชจ๋ธ์—์„œ ์ดํ•ด์™€ ์ƒ์„ฑ์„ ํ•˜๋‚˜์˜ ๊ตฌ์กฐ๋กœ ํ†ตํ•ฉํ•˜๊ฑฐ๋‚˜, Diffusion Transformer๋ฅผ 1000์ธต ์ด์ƒ์œผ๋กœ ๊นŠ๊ฒŒ ์Œ“๋Š” ๊ตฌ์กฐ์  ์ง„ํ™”.
  • ํ”„๋ผ์ด๋ฒ„์‹œ ๋ณด์กด ๊ฐœ์ธํ™” (Privacy-Preserving Personalization): ์—ฃ์ง€-ํด๋ผ์šฐ๋“œ ํ™˜๊ฒฝ์—์„œ ์—์ด์ „ํŠธ์˜ ๊ฐœ์ธํ™”๋œ ๋ฉ”๋ชจ๋ฆฌ๋ฅผ ํ™œ์šฉํ•˜๋ฉด์„œ ๋ฏผ๊ฐ ์ •๋ณด๋ฅผ ๋ณดํ˜ธํ•˜๋Š” ๊ธฐ์ˆ .

๊ณตํ†ต ์ฃผ์ œ

์ด๋ฒˆ ์ฃผ ์—ฐ๊ตฌ๋“ค์€ ๋Œ€๊ทœ๋ชจ AI ๋ชจ๋ธ์˜ **์„ฑ๋Šฅ ํ•œ๊ณ„ ๊ทน๋‹น(์„ฑ๋Šฅ ์‹ฌํ™”์™€ ํ†ตํ•ฉ)**๊ณผ **์‹ค์ œ ํ™˜๊ฒฝ์—์„œ์˜ ์šด์˜ ํšจ์œจ์„ฑ(์ธํ”„๋ผ ๋ฐ ํ”„๋ผ์ด๋ฒ„์‹œ)**์„ ๋™์‹œ์— ํ•ด๊ฒฐํ•˜๋ ค๋Š” ๋…ธ๋ ฅ์ด ๋‘๋“œ๋Ÿฌ์กŒ์Šต๋‹ˆ๋‹ค. ๋‹จ์ˆœํžˆ ๋ชจ๋ธ์˜ ํฌ๊ธฐ๋ฅผ ํ‚ค์šฐ๋Š” ๊ฒƒ์„ ๋„˜์–ด, 1000๊ฐœ ์ธต ์ด์ƒ์˜ ๊นŠ์€ ๋„คํŠธ์›Œํฌ๋ฅผ ์•ˆ์ •์ ์œผ๋กœ ํ•™์Šต์‹œํ‚ค๊ฑฐ๋‚˜ ์ดํ•ด์™€ ์ƒ์„ฑ์„ ํ†ตํ•ฉํ•˜๋Š” ๋“ฑ ๊ตฌ์กฐ์ ์ธ ํ˜์‹ ๊ณผ ํ•จ๊ป˜, ์‹ค์ œ ์„œ๋น„์Šค ๋‹จ๊ณ„์—์„œ์˜ ๋น„์šฉ ํšจ์œจ์„ฑ๊ณผ ๋ณด์•ˆ์„ ์ค‘์‹œํ•˜๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ํ๋ฅด๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

์ฃผ๋ชฉํ•  ์ 

๋…ผ๋ฌธ 2์—์„œ๋Š” 1000์ธต ์ด์ƒ์˜ Diffusion Transformer๋ฅผ ํ•™์Šตํ•  ๋•Œ ๋ฐœ์ƒํ•˜๋Š” โ€˜Mean Mode Screaming(MMS)โ€˜์ด๋ผ๋Š” ๋ถ•๊ดด ํ˜„์ƒ์„ ๊ทœ๋ช…ํ•˜๊ณ , ํ‰๊ท ๊ณผ ๋ถ„์‚ฐ์„ ๋ถ„๋ฆฌํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ์ด๋ฅผ ํ•ด๊ฒฐํ•˜์—ฌ ๋ชจ๋ธ ๊นŠ์ด ํ™•์žฅ์˜ ์ƒˆ๋กœ์šด ๊ฐ€๋Šฅ์„ฑ์„ ์ œ์‹œํ–ˆ์Šต๋‹ˆ๋‹ค. ๋˜ํ•œ ๋…ผ๋ฌธ 3์€ ๊ธฐ์กด์— ๋ถ„๋ฆฌ๋˜์–ด ์žˆ๋˜ ์‹œ๊ฐ์  โ€˜์ดํ•ดโ€™์™€ โ€˜์ƒ์„ฑโ€™ task๋ฅผ ํ•˜๋‚˜์˜ ์•„ํ‚คํ…์ฒ˜(NEO-unify)๋กœ ํ†ตํ•ฉํ•˜์—ฌ, ํŒŒํŽธํ™”๋œ ๋ชจ๋ธ ๊ตฌ์กฐ์˜ ํ•œ๊ณ„๋ฅผ ๋„˜์–ด์„  ๋„ค์ดํ‹ฐ๋ธŒ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ๋Šฅ๋ ฅ์˜ ์ถœํ˜„์„ ์‹œ์‚ฌํ•œ๋‹ค๋Š” ์ ์ด ๋งค์šฐ ํฅ๋ฏธ๋กญ์Šต๋‹ˆ๋‹ค.

์‹ค๋ฌด ์‹œ์‚ฌ์ 

๊ฐœ๋ฐœ์ž ๋ฐ ์—”์ง€๋‹ˆ์–ด๋Š” ๋…ผ๋ฌธ 1(MinT)์—์„œ ์ œ์‹œํ•˜๋Š” ๊ฒƒ์ฒ˜๋Ÿผ, ๋ง‰๋Œ€ํ•œ ๋น„์šฉ์ด ๋“œ๋Š” ์™„์ „ ๋ชจ๋ธ ์ฒดํฌํฌ์ธํŠธ ์ƒ์„ฑ ์—†์ด ๋ฒ ์ด์Šค ๋ชจ๋ธ ์œ„์— LoRA ์–ด๋Œ‘ํ„ฐ๋งŒ์„ ๋™์ ์œผ๋กœ ๊ต์ฒดํ•˜์—ฌ ์„œ๋น„์Šคํ•˜๋Š” ์ธํ”„๋ผ ๊ตฌ์กฐ๋ฅผ ๋„์ž…ํ•จ์œผ๋กœ์จ ์šด์˜ ๋น„์šฉ์„ ํš๊ธฐ์ ์œผ๋กœ ์ ˆ๊ฐํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ๋˜ํ•œ ๋…ผ๋ฌธ 5์˜ โ€˜๋‹จ์ˆœํ•˜๊ณ  ํ†ตํ•ฉ๋œ ์Šค์ผ€์ผ๋ง ๋ ˆ์‹œํ”ผโ€™๋Š” ๋ณต์žกํ•œ ๋ฌธ์ œ ํ•ด๊ฒฐ์ด ํ•„์š”ํ•œ ๋„๋ฉ”์ธ(๊ธˆ์œต, ๊ณผํ•™, ๋ฒ•๋ฅ  ๋“ฑ)์—์„œ ๊ณ ์„ฑ๋Šฅ ์ถ”๋ก  ๋ชจ๋ธ์„ ๊ฐœ๋ฐœํ•˜๊ณ ์ž ํ•  ๋•Œ ์‹ค์งˆ์ ์ธ ๊ฐ€์ด๋“œ๋ผ์ธ์œผ๋กœ ํ™œ์šฉ๋  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.


๐Ÿ“‘ ๋…ผ๋ฌธ๋ณ„ ์š”์•ฝ

๐Ÿฅ‡ 1. MinT: Managed Infrastructure for Training and Serving Millions of LLMs

arXiv: 2605.13779 | โฌ†๏ธ 205 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: mint lora llm-infrastructure fine-tuning model-serving mlops scalability efficient-ai

์ˆ˜์ฒœ์–ต ๊ฐœ์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋ฅผ ๊ฐ€์ง„ ๊ธฐ์ € ๋ชจ๋ธ์„ ๋ณต์‚ฌํ•˜์ง€ ์•Š๊ณ , LoRA ์–ด๋Œ‘ํ„ฐ๋งŒ ํšจ์œจ์ ์œผ๋กœ ๊ด€๋ฆฌํ•˜์—ฌ ์ˆ˜๋ฐฑ๋งŒ ๊ฐœ์˜ ๋งž์ถคํ˜• ๋ชจ๋ธ์„ ํ•™์Šตํ•˜๊ณ  ์„œ๋น„์Šคํ•  ์ˆ˜ ์žˆ๋Š” ํ˜์‹ ์ ์ธ ๊ด€๋ฆฌํ˜• ์ธํ”„๋ผ๋ฅผ ์ œ์‹œํ–ˆ๋‹ค๋Š” ์ ์—์„œ ๋งค์šฐ ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


๐Ÿฅˆ 2. Mean Mode Screaming: Meanโ€”Variance Split Residuals for 1000-Layer Diffusion Transformers

arXiv: 2605.06169 | โฌ†๏ธ 182 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: diffusion-transformer deep-learning training-stability gradient-analysis mv-split model-collapse generative-ai optimization

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


๐Ÿฅ‰ 3. SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

arXiv: 2605.12500 | โฌ†๏ธ 169 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: multimodal computer-vision nlp architecture unified-model generative-ai deep-learning

์ด ๋…ผ๋ฌธ์€ ๊ธฐ์กด์— ์„œ๋กœ ๋ถ„๋ฆฌ๋˜์–ด ์žˆ๋˜ ์‹œ๊ฐ ์ดํ•ด์™€ ์ƒ์„ฑ ๊ณผ์ œ๋ฅผ ํ•˜๋‚˜์˜ ๋„ค์ดํ‹ฐ๋ธŒ ํ†ตํ•ฉ ์•„ํ‚คํ…์ฒ˜(Native Unified Architecture)๋กœ ๊ฒฐํ•ฉํ•˜์—ฌ, ๋ชจ๋ธ์˜ ๊ตฌ์กฐ์  ํ•œ๊ณ„๋ฅผ ์—†์• ๊ณ  ์ง„์ •ํ•œ ๋ฉ€ํ‹ฐ๋ชจ๋‹ฌ ์ง€๋Šฅ์„ ๊ตฌํ˜„ํ–ˆ๋‹ค๋Š” ์ ์— ํ•ต์‹ฌ์ ์ธ ์˜์˜๊ฐ€ ์žˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


4. 4. MemPrivacy: Privacy-Preserving Personalized Memory Management for Edge-Cloud Agents

arXiv: 2605.09530 | โฌ†๏ธ 140 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: ai-paper ml

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


5. 5. Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

arXiv: 2605.13301 | โฌ†๏ธ 137 โ†’ Deep Dive ๋ณด๊ธฐ ํƒœ๊ทธ: llm reasoning olympiad reinforcement-learning scaling mathematical-reasoning test-time-compute

๋ณธ ๋…ผ๋ฌธ์€ ๋ณ„๋„์˜ ๊ธฐํ˜ธ์  ์—”์ง„(symbolic engine) ์—†์ด๋„ ๊ฑฐ๋Œ€ ์–ธ์–ด ๋ชจ๋ธ(LLM)์„ ๋Œ€์ƒ์œผ๋กœ ๋‹จ์ˆœํ•˜๋ฉด์„œ๋„ ํ†ตํ•ฉ๋œ ํ•™์Šต ๋ฐ ์ถ”๋ก  ํŒŒ์ดํ”„๋ผ์ธ์„ ์ ์šฉํ•˜์—ฌ, ๊ตญ์ œ ์ˆ˜ํ•™ ๋ฐ ๋ฌผ๋ฆฌ ์˜ฌ๋ฆผํ”ผ์•„๋“œ์—์„œ ๊ธˆ๋ฉ”๋‹ฌ ์ˆ˜์ค€์˜ ๋ฌธ์ œ ํ•ด๊ฒฐ ๋Šฅ๋ ฅ์„ ๋‹ฌ์„ฑํ–ˆ๋‹ค๋Š” ์ ์—์„œ ๋งค์šฐ ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค.

๐Ÿ“– ์ƒ์„ธ ๋ถ„์„: โ†’ Deep Dive ๋ณด๊ธฐ์—์„œ ์‹ฌ์ธต ๋ถ„์„์„ ํ™•์ธํ•˜์„ธ์š”.


๐Ÿ“… ์ƒ์„ฑ์ผ: 2026-05-17 | ๐Ÿค– GLM-4.7 Weekly Digest