The premier venues for artificial intelligence research are currently facing an unprecedented surge in academic submissions. Over the past decade, submissions to several leading conferences have increased more than tenfold. This massive influx creates a monumental challenge for the traditional peer-review system, which relies on a limited pool of volunteer experts to evaluate every submission with care. To address this overwhelming volume, one major conference has pioneered a novel strategy: asking the authors themselves to rank the quality of their own papers.
This initiative, introduced by the Neural Information Processing Systems (NeurIPS) conference, requires authors to submit a confidential self-assessment alongside their research manuscripts. This evaluation asks authors to position their work within a detailed taxonomy of contribution types and to rate its potential impact on the field. Proponents argue that this method offers reviewers crucial contextual information, which could help streamline the initial sorting of thousands of submissions. However, critics voice concerns that this approach may encourage strategic over-claiming rather than fostering genuine self-reflection among researchers.
Quantitative data underscores the magnitude of the challenge facing the field. NeurIPS, a cornerstone event in the discipline, received over 17,000 paper submissions in a recent year. This represents a staggering increase from the approximately 1,500 submissions received a decade prior. Other flagship conferences, such as the International Conference on Machine Learning (ICML) and the Conference on Computer Vision and Pattern Recognition (CVPR), report similar exponential growth. This volume vastly exceeds the capacity of the available reviewer corps, leading to rushed evaluations, severe reviewer burnout, and inconsistencies in the quality of feedback provided to authors.
The pressure on the system is not merely a matter of numbers. The scope and technical sophistication of AI research have also expanded dramatically. Modern papers now span foundational theory, novel algorithms, sophisticated engineering applications, and critical analyses of AI's societal implications. This diversity makes it increasingly difficult for any single reviewer, or even a small committee, to possess the deep expertise required to judge every submission fairly across its specialized domain. The sheer variety of topics means that finding experts for every unique angle of a paper is becoming nearly impossible.
The self-assessment protocol implemented by NeurIPS is structured and granular. Authors are not simply asked if their paper is "good." Instead, they must categorize their work using a predefined framework with specific dimensions. Key categories include:
Conference organizers emphasize that these self-assessments are confidential and are provided only to the assigned reviewers and senior area chairs. They are intended as a supplementary tool, not a replacement for expert critique. "The goal is to give reviewers a clearer starting point," explains a NeurIPS program chair. "When a reviewer sees a paper self-identified as a solid but incremental engineering improvement, they can calibrate their expectations differently than for a paper claiming a foundational theoretical breakthrough."