Can Artificial Intelligence (AI) Benefit Natural History Collections?

Or: How a Comment About iNaturalist Led Me Down an AI Rabbithole

By Erin Cashion, Curator of Natural History

Rather than an exhaustive report, this blog is intended to give a brief overview of the topic and jumping off points for further investigation.

This blog was written without the assistance of AI writing tools.

While identifying a butterfly on a social media post (an example of the species in question is below – Two-tailed Swallowtail, Papilio multicaudata), a commenter asked what guide I had used to make the identification. I had used iNaturalist, which is “an online social network of people sharing biodiversity information to help each other learn about nature”. Everyone who uses iNaturalist (or its kid-friendly version Seek) becomes a Natural History curator by cataloging the tree of life around them – a monumentally important task in the current 6th extinction – so I try to encourage people to use it whenever I can.

Two-tailed Swallowtail, Papilio multicaudata. This species is found west of the Rockies. Photo by Alan Schmierer, taken March 21, 2012 near Patagonia, AZ. Shared under Creative Commons CC0 1.0 Universal Public Domain Dedication, https://creativecommons.org/publicdomain/zero/1.0/

A yellow swallowtail butterfly with black stripes on a pink milkweed blossom.

The commenter replied: “I have a bit of distrust for AI identifications.”

This response was not unexpected or surprising. Artificial Intelligence (AI) has increasingly been in the news, and not much of the news has been positive. Recently, a non-peer reviewed preprint from MIT found that students who used ChatGPT (an AI chatbot) to compose reports had less brain connectivity and poorer recall than students who were only allowed to use their brains to write the reports. The many articles discussing this preprint suggested that the use of AI could be making us, to oversimplify, less intelligent. (But the preprint had another surprising finding that was not mentioned1.)

The idea that AI will bring about the end of civilization as we know it (one way or another) is widely accepted.

The concept of “artificial intelligence” has been in the social consciousness for over 200 years. Cautionary tales about the consequences of humans creating artificial life have been a staple in literature and film since Mary Shelley wrote Frankenstein in 1818, in which a fabricated "monster" with an inherently peaceful nature is ultimately driven to violence after being repeatedly rejected by humans (including his own creator). The term “robot” first entered the lexicon in the 1920 play R.U.R. by Karel Čapek, in which artificial workers rebel against their masters and bring about the extinction of the human race. (The word “robot” comes from Czeck robota, “forced labor”.) This theme2 has been repeated in numerous sci-fi literary and cinematic classics that people have consumed for generations, myself included; I’m of the Terminator generation.

AI in our non-fictional lives is a relatively recent phenomenon. In November 2022, the private company OpenAI launched ChatGPT, its AI chatbot, and two months later it already had 100 million active users. AI-generated art, uncanny “deepfakes”, and other AI-generated videos almost indistinguishable from real footage are now ubiquitous on social media. In late June, a new band appeared on Spotify and garnered a sizeable following in a matter of days. The band? Entirely AI generated. Many apps and other tech we use every day (like iNaturalist, Microsoft Teams, Google, and our cars!) use some kind of computer-aided algorithm to assist our work in various ways. We even have slurs for AI bots now – "clanker" being one (but as a good friend said, I'm pretty annoyed that "toaster" isn't.

Is there any good news? Aside from iNaturalist, are there other ways that AI tools are being used to benefit Natural History Collections and wildlife conservation?

The answer is unequivocally yes. Within the past few years, the Florida Museum of Natural History recently filled among the first curatorial positions for AI in the fields of Natural History and Archaeology, where AI is being used to inform how organisms evolve by aggregating the data that would take a human two hundred 8-hour days to collect in a single hour – saving staff time and institution funding.

AI is being used to benefit wildlife conservation by analyzing mountains of data in a fraction of the time it would take humans to do. AI-assisted game cameras are being used to alert local farmers of the presence of tigers in India, and a researcher at the University of Alberta is using AI to analyze thousands of hours of recordings of nighthawk calls to learn how and when they use their foraging vs. nesting habitats (see this article from Yale).

There are also potential and proposed ideas for how AI could be used to benefit Natural History collections and the curators who care for them.

Detailed habitat descriptions often accompany natural history specimens. These descriptions can include information about geology, hydrology, and other species found at the place of collection. These descriptions could be analyzed with AI tools (such as Machine Learning, a subcategory of AI that uses algorithms to find patterns in data) and used to train predictive models with georeferenced records. These models could reconstruct the historic range of species, predict where species might be found but have not been documented, update species range maps, and be used to document the spread of invasive species (see this article in Trends in Ecology & Evolution). Machine Learning could also be used to accelerate the analysis of mountains of data to predict the conservation status and extinction risk of rare and endangered species and update their listing on the IUCN Red List in a fraction of the time humans can do it (see this post from the Natural History Museum in London.)

The above are only a few examples, but there are many others. I invite the reader to investigate further!

In other AI-adjacent news, biotechnology company Colossal BioSciences announced in April 2025 they had brought the Direwolf back from extinction, and they used AI to do it. More recently, they announced their intention to bring the South Island Giant Moa (Dinornis robustus) back from extinction as well – only one of many species of moa, extinct flightless birds endemic to New Zealand. This goal is orders of magnitude more ambitious and difficult than what they accomplished with the wolves (and likely impossible). The announcement was met with skepticism and opposition by ecologists, geneticists, and Indigenous groups. Peter Jackson, The Lord of the Rings motion picture trilogy director (and keeper of a large private collection of moa specimens) is helping to fund the endeavor.

Whether de-extincting the moa would count as a “benefit” to Natural History remains to be seen – but if it happens, AI will have made it possible.

Additional photo details: This Upland Moa (Megalapteryx didinus) is one of the coolest existing specimens of moa (in my opinion) but is NOT the species Colossal Biosciences wants to “de-extinct”. A-B, Head and neck from left (A) and right (B) side. C-D, Right lower leg in medial (C) and lateral view (D).

However, there are endless examples of how AI is being used to the detriment of natural history (and society at large).

A lot of AI-generated material on social media is “clickbait”, provocative content that with little substance that is made to drive post engagement, or “ragebait”, inflammatory content that often stokes political division (which also drives post engagement). ChatGPT is being used for a variety of tasks that are questionable at best – the reader is probably already aware of a few examples, and likely has already used ChatGPT for one reason or another. AI-generated images of fictional plants and animals are shared on social media with identifications of actual species (my personal pet peeve), which I feel only widens the separation of people from nature. The internet is now rife with websites constructed with AI writing tools that contain inaccurate information about the natural world. Search engines will prioritize information from the most popular websites, even if they are AI-generated and incorrect, making the task of searching for factual information even more difficult. Foraging for wild foods has recently become popular, but the information regarding edibility of wild plants and fungi supplied by a search engine can be dangerous if it’s not correct. ChatGPT is being used to trawl the internet and churn out books for sale on Amazon written by fabricated authors that are full of false information.

There are two major potential issues with AI that deserve serious consideration and need to be addressed: energy use, and water use.

Water and energy use by data centers have been a concern for several years, particularly when they are built in already water-stressed areas; the use of AI tools is poised to exacerbate these existing issues. Rather than summarize these complex issues here, I recommend readers read this article from Yale and this article from MIT. The short version is that the alarm has sounded on water and energy use by AI, but at the time of this posting, the metrics being reported are extrapolations – but that is largely because either actual measurements are not being taken, or the data is not being disclosed2. In any case, energy use and water use by AI are serious concerns that need to be addressed.

N 1528. Asclepias sullivantii (Sullivant’s Milkweed). Collected June 34, 1926 by Edward S. Thomas in Pitt Township, Wyandot County. Housed at the Ohio History Connection, Columbus, Ohio. Photo credit: Erin Cashion, Ohio History Connection.

A white herbarium sheet of a dried pressed milkweed plant in flower with large rounded leaves. The color from the specimen has faded, but the label states the flowers were deep magenta in life.

Regardless, the genie is out of the bottle. Like any tool, AI can cause harm when used irresponsibly, but it has a vast potential for positive uses too. I am currently reorganizing our pressed plant collection according to the current taxonomy – like this specimen of Asclepias sullivantii (Sullivant’s Milkweed), which has been reassigned from Asclepiadaceae to Apocynaceae. As I'm contemplating the process of updating their digital catalog records (tediously typing in the data into the corresponding fields for each specimen one-by-one), I can’t help but wonder 1) how much time would be saved if all the data on the handwritten labels could be scanned by an AI-assisted program and deposited into a spreadsheet?  2) How much energy would that use, and how much water? 3) Is that less water and energy than I would use doing this task over the course of months instead of hours?

The identification of organisms requires both theoretical and practical knowledge; some argue that AI will never be able to learn in this way.

While it is true that iNaturalist uses a subset of AI tools (Computer Vision coupled with Machine Learning) to identify organisms, calling it an “intelligence” is a stretch. iNaturalist's "AI camera" is powered by a computer model “looks at” thousands of photos of organisms, analyzes them to recognize patterns in the images, and applies those patterns to new photos. Every new picture adds to the analysis and improves the accuracy of the computer model. But using iNaturalist’s AI-assisted camera and identifications are completely optional; users can assign their own IDs or post an organism without any ID at all, and leave identification up to other users, if they wish. (iNaturalist observations are not considered “Research Grade” until it has at least two congruent IDs.)

When I use iNaturalist’s AI-assisted identifications, it usually offers several options with a “% match” alongside. Depending on the quality of the photo, iNaturalist’s top suggestion may not be correct, but it is usually one of the alternatives provided. I cross-check its conclusions by looking up the scientific names in the appropriate field guides or online resources (or, since I have the privilege of stewarding a natural history collection, I can check real specimens), and then I compare field marks to the photo I’m trying to identify. Although iNaturalist takes range into account when making IDs, I also consult range maps. In some cases I may consult colleagues or other experts. Finally, once I decide on an ID and upload my observation, the human iNaturalist community checks it, often offering comments to support their argument if they disagree. In fact, having humans check and confirm IDs helps improve the accuracy of the identification models.

In other words, I’m letting “AI” do the heavy lifting to save me some time… but I’m still going to check its work.

 

Footnotes

  1. This preprint also reported that the people in the non AI-assisted group for their writing task showed significant increases in brain connectivity and had better recall after they were switched to the ChatGPT-assisted group. This seems to suggest AI may boost human cognition if it’s used as a post-hoc tool, rather than a load-bearing support beam. However, this study hasn’t been peer reviewed and had a small sample size, so I would like to see it repeated with more people!
  2. To be fair, there have been plenty of stories about "good robots", too: Wall-E, Short Circuit, Data of Star Trek (but not his "evil" brother Lore!), and many droids from the Star Wars universe come to mind; and even in Terminator 2: Judgement Day, the "bad robot" from the first film was reprogrammed to be a "good robot". These "good robot" stories often ask the viewer to question whether the robots are alive/sentient/deserving of full personhood – another theme that saturates the social consciousness: If we create artificial life, does it deserve rights?
  3. Google has disclosed what percentage of their energy processes are used for Machine Learning (ML). One study found that the energy use impact (measured in CO2 emission equivalents) was 300x greater for smartphones than that for ML. This was due to inefficiencies in energy use in using the smartphone’s processes for ML, vs. a centralized data center that is optimized for efficiency. The study also found that energy loss from smartphone chargers can be four times greater than the energy needed to run the smartphone itself. (Wireless chargers are particularly wasteful and inefficient.) Given the ubiquity of cell phones, this may suggest energy wastage by charging them is of greater concern than energy use by AI processes. (Others may argue it smacks of “the responsibility of environmental stewardship lies with individuals, not with corporations”.)

Subscribe to Our Blogs