About AIQ
The growth and development of AI systems and benchmarks have been rapidly increasing, yet there is a disproportionately small amount of examination into the domains used to evaluate these systems. Even the domains that consider a larger scope of evaluation are often not generalizable, and the implemented AI systems are often not usable for different domains.
To address these issues, we propose a measure of machine intelligence that can be intuitively understood and effectively utilized. This measure utilizes new metrics for both the agents evaluated and the domains used in the evaluation.
The intelligence of a given system is determined by measuring its performance on a multi-domain test with measurable complexity, similarity, and diversity. The Artificial Intelligence Quotient (AIQ) structures these domain side measurements into a clear and consistent framework that can be utilized to measure an AI system’s intelligence.
These domain side measurements allow for the creation of an intelligence space. Once a test is located within the space, its accompanying performance metric can be used to effectively scale the location and measuring an agent's capacitance, that is, the agent's capability to achieve a certain understanding of the domain. An agent that achieves a higher understanding of the domain will be given a higher AIQ score than one that achieves a lower understanding.
Agent capacitance also scales with domain complexity. An agent that achieves a comparable understanding of a more complex domain will be given a higher AIQ score than an agent on a less complex domain. These measures and resulting AIQ scores are evaluated using several intuitive experiments. Once the measures are shown to be capturing what was intended, we construct test suites using the AIQ framework. These test suites are then be evaluated using known AI systems to validate the AIQ score is capturing an agent's intelligence.
Launched the AIQ website!
We launched the first iteration of the AIQ website. Human Testing coming soon!
Learn morePublished the AIQ paper
The full framework of AIQ is proposed in dissertation. This encapsulates all of the previously metrics into a final computable score for AI (and humans) systems.
SAIL-ON Program Conclusion
The end of the SAIL-ON program. While this is the end of our first program, we are currently looking into additional groups to continue the research effort.