- Challenge advanced language models on mathematics topics.
- Verify factual accuracy and logical soundness.
- Suggest improvements to prompt engineering and evaluation metrics.
1 open remote positions
Project is shaping the future of AI. Large-scale language models are evolving from clever chatbots into powerful engines of scientific discovery.