Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

it's such a misinformed statement, as kimi2.5 was used as a base model for composer 2 and then heavily RLed


What does heavy RL even mean…similar to how the CEO of cursor said how much better the perplexity got when it’s a terrible metric for model fine tune performance? Let’s be real here, it’s Kimi 2.5 fine tuned for Cursor. There’s nothing wrong with that but they tried to hide it and it’s some work they put in but nothing close to training a model of their own.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: