Seungwoo Ryu's picture

Seungwoo Ryu

tryumanshow

·

AI & ML interests

LLM, Agent

Organizations

upvoted 2 articles 11 months ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

+4

toslali-ibm, mirinflim, qgallouedec, esnible, rganti, mudhakar

•

Jun 3, 2025

• 101

Article

🐯 Liger GRPO meets TRL

+4

shisahni, kashif, smohammadi, ShirinYamani, m0m0chen, liberty4321

•

May 25, 2025

• 53

upvoted a collection about 1 year ago

Reasoning Datasets

50 items • Updated Jun 8, 2025 • 11

upvoted an article about 1 year ago

Article

Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial

open-r1

•

Jan 31, 2025

• 51

upvoted a collection about 1 year ago

🧠 Reasoning model 2025

19 items • Updated Jan 4 • 6

upvoted a paper over 1 year ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 45

upvoted an article over 1 year ago

Article

Fixing Gradient Accumulation

+4

lysandre, ArthurZ, muellerzr, ydshieh, BenjaminB, pcuenq

•

Oct 16, 2024

• 66

upvoted 3 collections over 1 year ago

Korean Instruction Dataset

5 items • Updated Jan 24, 2025 • 8

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 49 items • Updated Mar 2 • 141

Korean Reward Modeling

Korean Datasets, Reward Models for RLHF • 15 items • Updated Mar 2 • 3

upvoted a paper over 1 year ago

DiaSynth -- Synthetic Dialogue Generation Framework

Paper • 2409.19020 • Published Sep 25, 2024 • 20

upvoted an article over 1 year ago

Article

The 5 Most Under-Rated Tools on Hugging Face

derek-thomas

•

Aug 22, 2024

• 93

upvoted a collection over 1 year ago

LLMs

469 items • Updated Mar 29 • 42

upvoted 2 papers almost 2 years ago

Octo-planner: On-device Language Model for Planner-Action Agents

Paper • 2406.18082 • Published Jun 26, 2024 • 48

Model Merging and Safety Alignment: One Bad Model Spoils the Bunch

Paper • 2406.14563 • Published Jun 20, 2024 • 30

upvoted 2 collections almost 2 years ago

Function Calling v3

Models fine-tuned for function-calling • 12 items • Updated Mar 2 • 21

Agents

Collection of resources related to Agents. • 73 items • Updated Jan 28, 2025 • 6

upvoted 2 collections about 2 years ago

Miqu-based Models

A collection of creative writing models based on the 'miqu-1-70b ' model. • 2 items • Updated Mar 2 • 2

Agents

63 items • Updated Jan 10, 2025 • 5

upvoted a paper about 2 years ago

Is Bigger Edit Batch Size Always Better? -- An Empirical Study on Model Editing with Llama-3

Paper • 2405.00664 • Published May 1, 2024 • 20