Back to Cinamon showcase

Cinamon Inc. ยท Part-aware Composition

๐ŸŽญ Part-aware Composition

ํ”„๋กœ์ ํŠธ ๊ธฐ๊ฐ„: 2025. 12 - 2026. 01

๋ณต์žกํ•œ ํ…์ŠคํŠธ ํ”„๋กฌํ”„ํŠธ๋ฅผ ==์ƒยทํ•˜์ฒด ์ˆ˜์ค€์˜ ์ œ์–ด ๊ฐ€๋Šฅํ•œ ๋™์ž‘ ํ‘œํ˜„==์œผ๋กœ ๋ถ„ํ•ดํ•ด ๋ชจ์…˜ ํ‘œํ˜„์˜ ๋‹ค์–‘์„ฑ์„ ๋†’์ด๋Š” PoC๋ฅผ ์ง„ํ–‰ํ–ˆ์Šต๋‹ˆ๋‹ค. LLM ๊ธฐ๋ฐ˜ prompt decomposition, retrieval composition, refinement๋ฅผ ==๋…๋ฆฝ ๋ชจ๋“ˆ๋กœ ์„ค๊ณ„ํ•ด ๋น ๋ฅด๊ฒŒ ๊ฒ€์ฆ ๊ฐ€๋Šฅํ•œ ํŒŒ์ดํ”„๋ผ์ธ==์œผ๋กœ ์žฌ๊ตฌ์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค. ๋ชจ๋“ˆํ™” ๋ฐฉ์‹์˜ ๋ฌธ์ œ ํ•ด๊ฒฐ์„ ํ†ตํ•ด ๋””๋ฒ„๊น… ๋‹จ์œ„๋ฅผ ์ค„์˜€๊ณ , ํŠนํžˆ refinement model์„ ๊ฐœ๋ณ„ ์‚ฌ๋‚ด ๋ชจ๋ธ๋กœ ๋ฐœ์ „์‹œ์ผฐ์Šต๋‹ˆ๋‹ค.

LLM Decomposition Retrieval Composition Diffusion Refinement

Media

์„œ๋กœ ๋‹ค๋ฅธ ํŒŒํŠธ ์กฐํ•ฉ์œผ๋กœ ์ƒ์„ฑํ•œ ๊ฒฐ๊ณผ ์ƒ˜ํ”Œ์„ ํ™•์ธํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

๐Ÿ“ ์—…๋ฌด ์ˆ˜ํ–‰ ๋‚ด์šฉ

flowchart LR prompt["Text Prompt"] --> llm["LLM Prompt Refine"] llm --> upper["Upper-body
Prompt"] llm --> lower["Lower-body
Prompt"] upper --> db["Retrieval Model"] lower --> db db --> upperMotion["Retrieved
Upper-body Motion"] db --> lowerMotion["Retrieved
Lower-body Motion"] upperMotion --> compose["Motion Composition"] lowerMotion --> compose compose --> refine["Refinement"] refine --> final["Final Motion"]
- ์กฐ์ง ๋‚ด AI๋กœ ๊ฐœ์„  ๊ฐ€๋Šฅํ•œ ์‹ค๋ฌด์  ์–ด๋ ค์›€์„ ๋ฐœ๊ตดํ•ด ํ”„๋กœ์ ํŠธ ์ œ์•ˆ์„œ๋ฅผ ์ž‘์„ฑํ•˜๊ณ  ํ”ผ์นญ์„ ์ง„ํ–‰ํ–ˆ์Šต๋‹ˆ๋‹ค. - ์ฝ˜ํ…์ธ  ๋‹ด๋‹น์ž์™€ ์ปคํ”ผ์ฑ—์„ ํ†ตํ•ด ==์‹ค๋ฌด์—์„œ ๋ณ‘๋ชฉ์ด ๋˜๋Š” ์ง€์ ์„ ๋ฐœ๊ฒฌ==ํ•˜๊ณ , AI ๊ธฐ์ˆ ์„ ํ†ตํ•ด ๋ชจ์…˜ ํ‘œํ˜„์˜ ๋‹ค์–‘์„ฑ์„ ๋†’์ผ ์ˆ˜ ์žˆ๋Š” ๋ฐฉ๋ฒ•์„ ์ œ์•ˆํ–ˆ์Šต๋‹ˆ๋‹ค. - ํ…Œ์ŠคํŠธ ์‹œ๋‚˜๋ฆฌ์˜ค๋ฅผ ๋ฐ”ํƒ•์œผ๋กœ ==์„ธ๋ถ€ ๋งˆ์ผ์Šคํ†ค==์„ ์„ค๊ณ„ํ•˜๊ณ  ๋‹จ๊ณ„๋ณ„ ๊ฒ€์ฆ์„ ์ˆ˜ํ–‰ํ–ˆ์Šต๋‹ˆ๋‹ค. - ๋งˆ์ผ์Šคํ†ค 1 : LLM์ด ํŒŒํŠธ๋ณ„ ์˜๋ฏธ๋ฅผ ์ž˜ ๋ถ„๋ฆฌํ•˜๋Š”๊ฐ€ - ๋งˆ์ผ์Šคํ†ค 2 : ๋ถ„๋ฆฌ๋œ ํŒŒํŠธ ์ •๋ณด๋กœ ์ ์ ˆํ•œ motion retrieval์ด ๊ฐ€๋Šฅํ•œ๊ฐ€ - ๋งˆ์ผ์Šคํ†ค 3 : ์ž„์˜๋กœ ํ•ฉ์„ฑํ•œ ๊ฒฐ๊ณผ๋ฅผ refinement ๋„คํŠธ์›Œํฌ๋กœ ์ž์—ฐ์Šค๋Ÿฝ๊ฒŒ ๋งŒ๋“ค ์ˆ˜ ์žˆ๋Š”๊ฐ€ - Facial expression์„ ์ œ์™ธํ•œ ์ €์ˆ˜์ค€, ๊ตฌ์ฒด์ ์ธ ๊ฒ€์ƒ‰ ๋ฌธ์žฅ์œผ๋กœ ์žฌ์ž‘์„ฑํ•˜๋„๋ก system prompt๋ฅผ ์ž‘์„ฑํ•ด ==๊ฒ€์ƒ‰ ๋ชจ๋ธ๊ณผ alignment==๋ฅผ ๋งž์ท„์Šต๋‹ˆ๋‹ค. - ์˜ˆ) 'Raise one hand and check the time on the watch'์— ๋Œ€ํ•œ ๊ฒ€์ƒ‰์šฉ prompt refinement - UpperBody: The person lifts their left arm and turns the wrist slightly. - LowerBody: The person stands still with legs relaxed and feet firmly on the ground. - ๋ถ„๋ฆฌํ•œ ์ƒยทํ•˜์ฒด prompt๋กœ ๊ฐ๊ฐ ๋ชจ์…˜์„ ๊ฒ€์ƒ‰ํ•œ ๋’ค, ==joint๋ณ„ local rotation== ์ •๋ณด๋ฅผ ์กฐํ•ฉํ•ด ํ•ฉ์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค. - 1-step denoising ๊ธฐ๋ฐ˜ motion refinement๋ฅผ ์ ์šฉํ•ด ==๋ชจ์…˜ ๋ณด์ • ๋ชจ๋ธ์˜ ๋…๋ฆฝ ๋ชจ๋“ˆํ™” ๊ฐ€๋Šฅ์„ฑ์„ ๊ฒ€์ฆ==ํ–ˆ์Šต๋‹ˆ๋‹ค.