Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

What if accountability is the ultimate online-learning hack? As in: The AI's participate in society etc, and are punished for bad behavior just like we are, thereby getting better.

Or do we just not call that "accountability"?



How do you propose that punishment make the AIs "get better"? From their perspective they're already as good as they can be, based on their training.


Reinforcement Learning can train a model based on some reward function. The suggestion is that real-world accountability could be translated into such a reward function.

Also, OP explicitly mentioned "online learning", which is a continuous training process after standard pre-training.

For what it's worth, I don't think this would work. Rewards would come in too sporadically to be useful.


I think its an interesting hypothesis to test. We should hold individual legal persons responsible for the actions of their AI, and those people can test the validity of the hypothesis by incorporating their personal consequences into the reward function. Email from legal: -10, $10k fine: -1000, 1 year in prison: -100000.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: