Preserving Access to Government Scientific Data under the Trump Administration

MFSThe inaugural March for Science was organized to coincide with Earth Day on April 22, 2017.  Tens of thousands of people rallied in Washington, DC and over 600 other locations across the globe. The organizers were motivated by what they saw as the Trump administration’s hostility to science on a variety of issues. These ranged from Trump’s own statements denying climate change, to the anti-science posture of officials appointed to key federal agencies such as the Environmental Protection Agency (EPA) and Energy Department, to proposed funding cutbacks at the National Institutes of Health.

From the time Trump was elected president, scientists and their allies have had serious concerns about how this would affect the federal government’s policies on science. One of their deepest fears is losing access to the vast amount of scientific data maintained by the federal government and made available to the public. The Obama presidency was committed to open data.  Under the Obama administration, “increasing access to scientific data and research findings generated by Federal agencies or resulting from Federally funded research” was viewed as a policy priority.  This policy has been reversed since Trump took office.

For example, Victoria Herrmann, an Arctic researcher, reported that the U.S. National Strategy for the Arctic, Implementation Plan for the Strategy, and reports about progress, disappeared from government websites the day after Trump’s inauguration. She stated that the months that followed have been “transformed into a slow, incessant march of deleting datasets, webpages and policies about the Arctic.” Other reports indicate that while outright deletion remains relatively uncommon, the new administration is making data harder to find, and will soon be cutting funding to the point where collecting data becomes difficult for federal agencies.

In response to these concerns, scientists, programmers, librarians, academics, and others have gathered in locations across North America at “data rescue” events organized by groups such as Data Refuge and the Environmental Data and Governance Initiative (EDGI).  EDGIThese gatherings are essentially hackathons during which participants endeavor to download, save, and archive scientific data maintained by the government.  Event organizers face continuing technical and logistical challenges, including: developing broadly-applicable tools that can be used to extract data from a variety of government websites, confirming the integrity of the data, securing sufficient long-term storage, documenting the chain of custody, and developing procedures to facilitate future public access. 

Amidst the ongoing data rescue efforts, it was widely reported that in the event of a government shutdown, on April 28, 2017 the EPA would take down their Open Data portal that provides data on climate change, pollution, and public health. The EPA subsequently responded that the portal would not be updated, but would not go dark in the event of a shutdown. This episode nonetheless raised an important question: can federal government agencies like the EPA simply delete data or deny public access to the data they maintain?

On this issue, federal agencies are constrained by statutory and administrative regulations. The Paperwork Reduction Act states that government agencies must “provide adequate notice when initiating, substantially modifying, or terminating significant information dissemination products.” 44 U.S.C. § 3506(d)(3).  The Federal Records Act (FRA) further limits how federal agencies can dispose of data. The FRA broadly defines records to include “all recorded information, regardless of form or characteristics, made or received by a Federal agency under Federal law or in connection with the transaction of public business and preserved or appropriate for preservation by that agency or its legitimate successor as evidence of the organization, Nara-Logofunctions, policies, decisions, procedures, operations, or other activities of the United States Government or because of the informational value of data in them.”  44 U.S.C. § 3301(a)(1)(A). The statute requires permanent records, i.e. those of continuing value, to be preserved and deposited in the National Archives and Records Administration (NARA). NARA has provided administrative guidance on how scientific, environmental, and technical data can be appraised and preserved. Notably, on December 22, 2016, NARA sent all federal agencies a memorandum on preservation of federal records that stated: “In many cases, websites contain databases or datasets. We remind agencies that such data, or the systems in which they reside, must be scheduled as Federal records.”

It is still an open question as to whether these laws have any teeth. Patrice McDermott, the author of Who Needs to Know? The State of Public Access to Federal Government Information (2007), recently stated that while the Federal Records Act makes it an offense to knowingly and arbitrarily destroy government records, “No one — NO ONE, period! – has ever been prosecuted for doing it.”  Moreover, in Kissinger v. Reporters Committee, 445 U.S. 136 (1980), the U.S. Supreme Court held that the Federal Records Act was created to benefit the federal government and its agencies, ruling that the statute contains neither an express nor an implied private right of action.  

The battle over science policy and scientific data also rages on in the political arena. Rep. Betty McCollum (D-MN) has introduced H.R. 1232: “Save America’s Science Act” to require Federal agencies to maintain and preserve their data assets — even as other proposed bills make their way through Congress which purport to promote transparency in scientific research but would gut the efforts of the EPA if passed, according to critics.  

Data rescue events continue to be organized across the country, and it remains to be seen how effective they will be in scaling up the extraction and archiving of data.  These efforts have, at the very least, brought visibility to the importance of ensuring public access to the treasure trove of scientific data maintained by the federal government.

Read This If you Love Science

From the White House:

This Friday, asteroid 1998 QE2 will sail serenely past Earth, getting no closer than about 3.6 million miles (5.8 million kilometers). The fly-by is harmless — at its closest, the asteroid will be about 15 times farther from us than the Earth is from the Moon. But we still think it’s a good topic for conversation.

asteriodTomorrow at 2 p.m. EDT the White House is hosting the second in a series of “We the Geeks” a new series of Google+ Hangouts,  to highlight the future of science, technology, and innovation here in the United States. Tomorrow’s hangout will focus on asteroids with Bill Nye the Science Guy, former astronaut Ed Lu, NASA Deputy Administrator Lori Garver, and more.

Tune in to find out how experts are working to find asteroids before they find us—and other potential opportunities these space rocks present.  They will be taking questions. Use #WeTheGeeks to submit them.

Science.gov Celebrates 10 years of Service

From Science.gov website:

What is the gas mileage of your car? How much of your state is water covered? Where can you download software that will give atomic precision to your computer’s clock? These and many other questions can and have been answered by Science.gov – the Federal government’s “one stop” real-time science data website, which is now celebrating its 10th anniversary.SciGov_10th_logo

Science.gov was launched in December 2002 and has broken new ground in interagency collaboration. Senior information managers representing 13 Federal agencies forged a consensus on how national science organizations could improve public access to the nation’s rich and diverse scientific research information and partnered to create the Science.gov Web portal.

“Over the past 10 years Science.gov has grown in content, capability, features and functionality,” said Tina Gheen of the Library of Congress and chair of the Science.gov Alliance. “As a result, we have seen significant user growth. We are very proud of this offering of sound science to the public, and we hope to continue providing this free service for years to come.”

There are now more than 200 million pages in Science.gov, and the annual page views top 34 million, a 45-fold increase from the earliest days. The founding idea was to provide a comprehensive gateway to federal science information for those who might not know exactly where to find it. The interagency effort would raise scientific and technical literacy, serve as a foundation for future discoveries, and foster greater understanding of the public’s return on investment from the government’s science and technology efforts.

The Science.gov website is hosted by the U.S. Department of Energy Office of Scientific and Technical Information (OSTI), within the Office of Science.