Lorenz Kuhn
University of Oxford
Why do you care about AI Existential Safety?
It seems plausible that we will develop highly capable AI systems in the near future. While AI has the potential to have a positive impact on the world, it also has the potential to cause significant harm if not developed responsibly. Even under relatively weak assumptions about future AI systems, it is likely that they will be more powerful than humans in some ways. If those AI systems are not sufficiently aligned with humans, this might lead to dangerous and unpredictable outcomes.
Please give at least one example of your research interests related to AI existential safety:
- Scalable oversight, in particular the automatic evaluation of large language models.
- Uncertainty estimation in large language models.
- Generalization in deep learning.