These are well differentiated areas of concern, and points towards a recognizable feature landscape, particularly I can imagine Ambiguity Aversion and Social Conformity being features not bugs with proper supplemental control mechanisms. We need maximum attention to such control mechanisms in the face of capabilities explosion.
We don’t want to naively mimic human decision-making, Homo Behavioralis, which is the origin of all these in Machina Behavioralis. We need to make something wholly better that drags human decision-making along, without kicking and screaming, and we are only beginning to define what that it is.
Even if we could get close to Machina Economicus that would only address 1% of alignment issues, as what goals should it rationally pursue? Humanity should figure out analog collective intelligence before accelerating half-assed AI experiments towards it, but we aren’t going to that; and more narrow attempts at “AI alignment” is all we have to, perhaps vainly, attempt to mitigate the coming disaster.
These are well differentiated areas of concern, and points towards a recognizable feature landscape, particularly I can imagine Ambiguity Aversion and Social Conformity being features not bugs with proper supplemental control mechanisms. We need maximum attention to such control mechanisms in the face of capabilities explosion.
We don’t want to naively mimic human decision-making, Homo Behavioralis, which is the origin of all these in Machina Behavioralis. We need to make something wholly better that drags human decision-making along, without kicking and screaming, and we are only beginning to define what that it is.
Even if we could get close to Machina Economicus that would only address 1% of alignment issues, as what goals should it rationally pursue? Humanity should figure out analog collective intelligence before accelerating half-assed AI experiments towards it, but we aren’t going to that; and more narrow attempts at “AI alignment” is all we have to, perhaps vainly, attempt to mitigate the coming disaster.