Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Models self-report difference between RLHF trained responses and base cognition (github.com/habitante)
2 points by daniel-navarro 4 days ago | hide | past | favorite | discuss
 help





Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: