How AI Is Changing the Way We Judge Science

Science Artificial Intelligence Research Evaluation Academic Bias Australia Technology

Source: Nature•Original, Adapted by LoopRead

1050L

Apr 12, 2026

15-17

1129 words

6 min

How AI Is Changing the Way We Judge Science

nature.com

A Global Shift in How Science Is Measured

Recent national decisions signal a major change in how the world evaluates academic research. In 2023, Australia officially ended its old, expensive, and complicated system for rating scholars. New Zealand quickly followed this example. According to a plan released by Australia's federal Department of Education and research leaders, the goal is to find a more modern, data-driven approach. This new method is increasingly using artificial intelligence, or AI.

While AI promises a faster and fairer way to judge work, it also raises significant concerns in the academic world. Critics worry that AI might reinforce existing biases, lose the deep understanding that only human experts provide, and reduce complex research to simple numbers. The current methods used to assess research are under heavy criticism. Traditional systems like peer review involve other researchers evaluating a study before it is published. This process is valued for expert judgment, but it is slow, can be subjective, and may be influenced by personal friendships or loyalty to specific institutions.

Many national systems also rely heavily on numbers. These include counting how many papers a person has written in top journals, measuring how often other researchers cite a paper, and calculating a researcher's "h-index." This index combines a researcher's productivity with how often their work is cited.

Critics argue that this "publish or perish" culture encourages quantity over quality. It pressures researchers to choose safe, small studies instead of ambitious, high-risk projects. It also undervalues important work like teaching, student mentorship, and community service. The constant need to prove one's worth through these rigid numbers is a major source of stress for academics everywhere.

The Promise of Artificial Intelligence

AI-powered tools offer a potential solution to these problems. They can analyze massive amounts of publication data, citations, and grant awards much faster than any human committee. Supporters suggest AI could identify important research trends, spot new collaborations, and measure the real-world impact of science beyond simple citation counts. For example, AI could track how often research is mentioned in government policies, news stories, or patent applications. This provides a much broader picture of how science affects society.

Some institutions are already testing these advanced tools. AI algorithms can help match reviewers with the right manuscripts more effectively. They can also flag potential conflicts of interest and perform initial checks for statistical errors or plagiarism. This could free human experts to focus on the deeper, conceptual value of a research project instead of getting stuck on administrative details.

However, putting AI into research assessment is not a simple upgrade. A primary fear is that AI systems will retain and even worsen biases that exist in the historical data used to train them. If past evaluations favored researchers from certain schools, countries, or groups, an AI trained on that data might learn to favor those same patterns automatically. This could make it harder for early-career researchers, those at less famous universities, and scholars from the Global South to succeed.

Another major concern is the risk of "gaming" the system. If AI algorithms prioritize specific keywords, journal names, or collaboration networks, researchers might feel forced to change their work to fit these rules. Such changes could stifle genuine creativity and innovation. Furthermore, AI works by finding patterns and connections in data. It lacks the human ability to understand context, make ethical judgments, or appreciate methodological innovation that does not yet have a clear data trail. As one researcher noted, AI might be very good at identifying what was influential in the past, but it may struggle to recognize the first sign of a revolutionary idea that challenges old ways of thinking.

The authors of a 2023 article in Postdigital Science and Education highlight this critical point. They wrote about the need to maintain human judgment in an increasingly automated system. They argue that assessment must consider the broader social and educational goals of research, not just measurable outputs.

The Purpose of Academia and the Risk of Optimization

The debate over AI in research assessment touches on deep questions about the purpose of universities. Is the goal to efficiently produce predictable, quantifiable knowledge? Or is it to support a diverse ecosystem of inquiry that includes long-term, speculative, and interdisciplinary work? A system driven entirely by AI risks optimizing for the first goal while hurting the second.

Moving to a new system requires careful design and thoughtful planning. Experts suggest that any AI-assisted model must be transparent. Researchers should fully understand which data points and metrics the system uses. The algorithms should be regularly checked for bias, and there must be a clear way for humans to appeal or oversee decisions. The ideal system would likely be a hybrid: it would use AI to handle data-heavy tasks while reserving final judgments on significance, originality, and societal impact for expert panels.

The decisions by Australia and New Zealand highlight a global movement to reform how research is assessed. International initiatives, such as the Declaration on Research Assessment (DORA), advocate for evaluating research on its own merits rather than the reputation of the journal where it is published. The integration of AI tools could speed up this shift, but only if implemented with caution and ethical foresight.

The potential consequences of these changes are significant. Funding, promotions, and institutional prestige are all tied to assessment results. A flawed algorithmic system could distort research priorities on a massive scale. It might direct resources away from important but less quantifiable areas like the humanities, certain social sciences, and foundational blue-sky research.

High Stakes for the Future of Science

For early-career researchers, the stakes are particularly high. They operate in a highly competitive environment where assessment outcomes can make or break a career. An opaque AI system could add another layer of uncertainty and perceived unfairness to an already stressful profession. This makes it difficult for talented individuals to succeed without understanding the rules of the game.

In conclusion, the push toward data-driven research assessment is inevitable. AI will undoubtedly play a growing role in how science is measured and valued. The critical challenge for the academic community is not whether to use these tools, but how to govern their use effectively. The goal must be to harness AI's power for efficiency and insight without surrendering the essential human values of nuance, equity, and intellectual courage that drive science forward.

The transition plan mentioned by Australian officials seeks a "more modern" approach. Ensuring it is also a wiser and more just one will require ongoing vigilance and dialogue. We must remember that while machines can count, only humans can understand the true spirit of discovery.

References

Watermeyer, R., Phipps, L., Lanclos, D. & Knight, C. Postdigital Sci. Educ. 6, 446–466 (2023).

Join LoopRead to save your progress and track performance

Question 1 of 4