[Generative AI with LLM][Reinforcement learning from human feedback, LLM-based Application]

2025.08.05

[Reinforcement learning from human feedback : RLHF] LLM은 유해한 data를 반영하기도 함 -> ...

관련 포스팅

Copyright blog.dowoo.me All right reserved.