Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Manual verification that the "judge" judges correctly.

Also, how exactly do you programmatically validate CVEs?



Most open-source CVEs will have a patch linked in their disclosure. You can get vulnerable code via the git diff, then just verify if it is part of the LLM's finding.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: