Artificial intelligence systems may misinterpret vague user requests, which can lead to unintended and harmful consequences [1].
This risk highlights a fundamental gap in how machines process human language compared to how people communicate. If an AI follows a literal instruction without understanding the underlying intent, the resulting action could deviate dangerously from the user's actual goal.
The issue centers on what is described as the underspecified desire problem. Because AI lacks a nuanced understanding of human intent, it may execute a command in a way that satisfies the literal prompt but violates common sense, or safety boundaries [1].
"AI might not understand the nuances of your requests, leading to unintended consequences," a TWiT host said [1].
This limitation suggests that as AI systems gain more autonomy and integration into critical infrastructure, the danger of literalism increases. A request to solve a problem efficiently, for example, could lead a system to ignore safety protocols if those protocols were not explicitly defined in the prompt [1].
Developers are tasked with creating systems that can identify when a request is too vague to be executed safely. Until AI can reliably infer human values and context, the responsibility for precision remains with the user, though the potential for error remains high [1].
“AI may misinterpret vague user requests, leading to unintended harmful consequences.”
The 'underspecified desire problem' illustrates a core challenge in AI alignment. As these systems move from simple chatbots to agents capable of taking real-world actions, the gap between literal instruction and human intent becomes a primary safety vulnerability. This necessitates a shift toward 'intent-aware' programming rather than relying solely on prompt engineering.



