-
Less is More: Recursive Reasoning with Tiny Networks
Paper • 2510.04871 • Published • 507 -
Cache-to-Cache: Direct Semantic Communication Between Large Language Models
Paper • 2510.03215 • Published • 98 -
When Thoughts Meet Facts: Reusable Reasoning for Long-Context LMs
Paper • 2510.07499 • Published • 48 -
StreamingVLM: Real-Time Understanding for Infinite Video Streams
Paper • 2510.09608 • Published • 51
Jiwon Song
jiwonsong
AI & ML interests
Efficient AI | Ph.D Student @ SNU-VLSI
Recent Activity
authored
a paper
6 days ago
RelayGen: Intra-Generation Model Switching for Efficient Reasoning
liked
a model
7 days ago
dongwonjo/Llama-1-13B-BinaryMoS-E4
upvoted
a
paper
7 days ago
Squeezing Large-Scale Diffusion Models for Mobile