Understand AI Psychology without Assuming Human-Like Psychology
Observing emergent AI decision-making processes and cognitive patterns with fewer anthropomorphic assumptions.
Resources (1)
Role play with large language models
Technology Seed
R&D Gaps (1)
The potential for AI systems to behave unpredictably or dangerously (“go rogue”) is a critical concern. Ensuring safe and controllable AI architectures is essential for reliable operation.
See also:
• https://www.lesswrong.com/posts/fAW6RXLKTLHC3WXkS/shallow-review-of-technical-ai-safety-2024
• h...