A lake of water stands at the foot of Austria’s receding Pasterze glacier this summer. Glaciers across Europe have been receding since approximately the 1870s, but the process has accelerated since the early 1980s, and many scientists attribute that phenomenon to global warming. In the United States, professors who worry that the administration of President-elect Donald J. Trump will be “willfully hostile toward the scientific process” are leading a campaign to preserve huge amounts of federal climate-science data.
Scientists, librarians, and digital historians from a growing number of universities have begun a crowdsourced effort to copy and archive thousands of federal-government websites and data sets related to climate change, the environment, and other areas of scientific research that they fear could become compromised or inaccessible under the incoming Trump administration.
The movement, now known by its hashtag, #DataRefuge, took off in response to an essay in The Washington Post in which a meteorologist and journalist named Eric Holthaus described how he had begun “to systematically catalog and preserve as much of the federal government’s publicly available climate-science data as possible in the next five weeks.”
We're sorry. Something went wrong.
We are unable to fully display the content of this page.
The most likely cause of this is a content blocker on your computer or network.
Please allow access to our site, and then refresh this page.
You may then be asked to log in, create an account if you don't already have one,
or subscribe.
If you continue to experience issues, please contact us at 202-466-1032 or help@chronicle.com
Sean Gallup, Getty Images
A lake of water stands at the foot of Austria’s receding Pasterze glacier this summer. Glaciers across Europe have been receding since approximately the 1870s, but the process has accelerated since the early 1980s, and many scientists attribute that phenomenon to global warming. In the United States, professors who worry that the administration of President-elect Donald J. Trump will be “willfully hostile toward the scientific process” are leading a campaign to preserve huge amounts of federal climate-science data.
Scientists, librarians, and digital historians from a growing number of universities have begun a crowdsourced effort to copy and archive thousands of federal-government websites and data sets related to climate change, the environment, and other areas of scientific research that they fear could become compromised or inaccessible under the incoming Trump administration.
The movement, now known by its hashtag, #DataRefuge, took off in response to an essay in The Washington Post in which a meteorologist and journalist named Eric Holthaus described how he had begun “to systematically catalog and preserve as much of the federal government’s publicly available climate-science data as possible in the next five weeks.”
Mr. Holthaus said he was acting because President-elect Donald J. Trump, who has called climate change a hoax perpetuated by the China, was “relentlessly converting” his anti-science rhetoric into action through nominations for key posts in an administration that is “likely to be willfully hostile toward the scientific process.”
Given the hostility toward climate science and environmental regulations shown by some of the president-elect’s nominees, some scientists say it’s prudent to take steps to ensure that vital information isn’t compromised.
Mr. Holthaus posted an online form that allows scientists and others to suggest specific data sets and websites to be archived. The form automatically puts the suggested material onto a spreadsheet that is openly accessible. Volunteers can also use the form to “claim a data set to rescue.”
“The spreadsheet went crazy,” says Bethany Wiggin, a professor at the University of Pennsylvania who directs its Program in Environmental Humanities.
ADVERTISEMENT
On Tuesday, the Penn program, whose professors and graduate students had already been discussing an effort of this sort, agreed to take over management of the spreadsheet. With support from Penn Libraries, the program will help coordinate and advance the fast-evolving project. Librarians and professors from several departments at the University of Toronto are also partners in the #DataRefuge effort.
On Saturday, the Toronto institution will play host to an event it calls a “guerrilla archiving,” during which professors will work with students and other volunteers on creating digital tools to improve the process of uploading, tagging, and archiving government data sets, as well as getting started on saving some “vulnerable” data. The Penn program will host a similar event next month. Academics at the two institutions say they are also making connections with scholarly and technology organizations like the Society of American Archivists, the Coalition for Networked Information, and the Internet Archive for additional expertise and volunteer help.
The Chronicle spoke Wednesday with Ms. Wiggin and with Patrick Keilty, an assistant professor at Toronto’s faculty of information, about the scope, challenges, and rationale for the project. Here are some answers to key questions about their efforts.
Why do this? Is this just some kind of anti-Trump political statement?
Both professors say they’re undertaking the project in a better-safe-than-sorry vein.
ADVERTISEMENT
“It’s not meant to be alarmist,” says Ms. Wiggin, but given the hostility toward climate science and environmental regulations shown by some of Mr. Trump’s nominees, she says it’s prudent to take steps to ensure that vital information isn’t compromised.
The worst-case scenario, says Mr. Keilty, would be outright deletion. But “even if the data isn’t deleted on Day 1,” he says, “it can slowly be made inaccessible by lack of maintenance.” Routine decisions, like federal departments’ remaking websites to reflect new priorities, can also mean “links go down” and key files go dead, he said. And for departments targeted for defunding, like the Environmental Protection Agency, he says new leaders there may not provide the labor or resources needed to maintain data.
“Access to government-funded research shouldn’t be a controversial issue. Evidence-based policy making shouldn’t be a controversial issue,” says Mr. Keilty. “It’s hard to create evidence-based policies if the evidence goes missing or becomes inaccessible.”
Ms. Wiggin says the #DataRefuge project fits within the purview of the “strong public-engagement component” of her program, while also acknowledging that the effort could be seen as “a political intervention.” But, she says, that doesn’t make it inappropriate. “Research happens in a social context. It doesn’t happen in a vacuum, Part of it is the political context.”
Any significance to the fact that a Canadian university is involved?
ADVERTISEMENT
Canadian academics are especially attuned to this issue, says Mr. Keilty, because in 2014, under former Prime Minister Stephen Harper, the government abruptly closed a number of well-respected natural-science libraries, to the alarm of many scientists and environmentalists. Mr. Harper was also viewed as being hostile to scientists. In the United States, Mr. Keilty added, the George W. Bush administration sought over a period of several years to close down the EPA library system.
Ms. Wiggin, who teaches courses on censorship as well those on the social impacts of climate change, says projects like the archiving effort make sense “if you’re somebody who teaches that history provides lessons.”
Since 2008, the Internet Archive has been working with partners to preserve federal websites and records as presidential administrations change, as part of the End of Term Web Archive. Doesn’t this duplicate that effort?
The Internet Archive uses tools known as web crawlers that can capture sites with URLs. Mr. Keilty says the #DataRefuge project will prioritize documents held in PDF files, Excel sheets, and other digital formats that web crawlers don’t pick up.
The effort is a giant lift, not just in terms of technology and digital storage space, but also manpower, data security, and resources. How is that being handled?
ADVERTISEMENT
Many of those questions are still being resolved. Project organizers say they’ll need secure storage for data measured in petabytes, and they’ll want to be sure the storage sites are dispersed. “More copies in more places is a better idea,” says Ms. Wiggin. Both professors say they’ve received offers from companies and other organizations for free and low-cost storage, as well as offers of financial support from people Ms. Wiggin calls “angel investors,” but for now the professors have declined to name them. Mr. Keilty says he’s also heard from professors, librarians, and others from at least a dozen other universities with offers of help, but doesn’t yet know how many of them will formally become involved or be identified. With so many librarians, archivists, and technologists showing interest, Ms. Wiggin says they are prepared to “really tap into a hugely active and knowledgeable community.”
What about data sets in other fields — say, the social sciences — where academics might fear political interference?
“We staked out climate and environmental data,” says Ms. Wiggin. But she adds that she wouldn’t be surprised if academics in other fields followed suit.
“I hope we inspire other people,” she says. “We can’t download the internet.”
Goldie Blumenstyk writes about the intersection of business and higher education. Check out www.goldieblumenstyk.com for information on her new book about the higher-education crisis; follow her on Twitter @GoldieStandard; or email her at goldie@chronicle.com.
The veteran reporter Goldie Blumenstyk writes a weekly newsletter, The Edge, about the people, ideas, and trends changing higher education. Find her on Twitter @GoldieStandard. She is also the author of the bestselling book American Higher Education in Crisis? What Everyone Needs to Know.