TuringBox: An Experimental Platform for the Evaluation of AI Systems

TuringBox: An Experimental Platform for the Evaluation of AI Systems

Ziv Epstein, Blakeley H. Payne, Judy Hanwen Shen, Casey Jisoo Hong, Bjarke Felbo, Abhimanyu Dubey, Matthew Groh, Nick Obradovich, Manuel Cebrian, Iyad Rahwan

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence

We introduce TuringBox, a platform to democratize the study of AI. On one side of the platform, AI contributors upload existing and novel algorithms to be studied scientifically by others. On the other side, AI examiners develop and post machine intelligence tasks to evaluate and characterize the outputs of algorithms. We outline the architecture of such a platform, and describe two interactive case studies of algorithmic auditing on the platform.
Keywords:
Multidisciplinary Topics and Applications: AI and Social Sciences
Multidisciplinary Topics and Applications: Human-Computer Interaction
Natural Language Processing: NLP Applications and Tools
Multidisciplinary Topics and Applications: Multidisciplinary Topics and Applications