I mean isn't it like the uncanny valley? it's sort of like "the real thing" aka human created artworks that we traditionally love and connect with, but just far enough that it gives us even more disgust that something completely not human created(like an inanimate object) would.
Yes but with the "judgement" to call them. If you put "review the results based on conditions described here and anything else suspicious you may spot before call the <next_deterministic_program>", it should be able to catch some case you didn't think about in your standard checks. Of course it may miss out on those or have false positives but that is the nature of the beast, as it is now.