Tag Archives: Digital Archives

History Channel Gonna History Channel

In between producing television shows about ice road truckers, swamp people, or whatever else the History Channel airs these days, the famously un-historic channel gained attention for recently claiming that pilots Amelia Earhart and Fred Noonan survived their plane crash in the Marshall Islands and were subsequently captured by the Japanese military. For whatever reason, the History Channel’s social media feeds are playing up a dubious claim that somehow the federal government is actively suppressing the “truth” of Earhart’s story, even though the documents they found to support their theory of Earhart and Noonan’s disappearance came from…a government archive.

Photo Credit: Twitter Feed of author and public historian Gordon Belt.

According to the official website of the National Archives and Records Administration, the agency possesses “approximately 10 billion pages of textual records; 12 million maps, charts, and architectural and engineering drawings; 25 million still photographs and graphics; 24 million aerial photographs; 300,000 reels of motion picture film; 400,000 video and sound recordings; and 133 terabytes of electronic data.” It should not be surprising that some of these documents get placed in storage and are sometimes forgotten about by researchers (or they simply don’t know the documents exist). That is not the same as saying the National Archives is deliberately withholding an unclassified document from researchers in the interest of hiding the government’s “secrets.”

By now I should realize that it’s all about the ratings when it comes to the History Channel. Support your local archivist and thank them for preserving history!


UPDATE: There’s a good chance the History Channel’s claims about Earhart are untrue. The power of history blogging!

Promises and Perils of Online Archives

The popular biographer Walter Issacson recently penned an op-ed for the Wall Street Journal in which he muses on “what could be lost as Einstein’s papers go online.” The essay was sparked by the recent digital publication of the first thirteen volumes of Einstein’s papers by a consortium of institutions that includes Princeton, Caltech, and Hebrew University. Issacson uses this development to explore the nature of online archives more broadly, weighing the potential benefits and consequences of opening primary source documents to what he describes as “the wisdom of the crowd.” There is a tension underlying these thoughts, and as the essay title suggests, Issacson seems fairly preoccupied with what could be lost as the archives go online:

My initial joy about the project was tempered, however, by a pinch of sadness. I realized that most future Einstein researchers would no longer have to make the journey to the cozy house on the edge of the Caltech campus where the scholars of the Einstein Papers Project were eager to embrace their rare visitors and ply them with guidance, insights and tea. They wouldn’t likely spend delightful days there—as I did for my biography of Einstein—with the science historian Diana Kormos-Buchwald and her colleagues as they debated such issues as how to explain what Einstein meant when he referred to quanta as “spatial” or his fellow Jews as Stammesgenossen (tribal comrades).

The next generation of scholars will also lose the tingling inspiration of seeing original documents. I remember how much closer I felt to Benjamin Franklin —suddenly, he seemed like a real person—when, at his archives in Yale’s Sterling Library, I first touched a letter that he had written, marveling that this piece of paper had actually once been in his hands. I even made a pilgrimage to the Hebrew University of Jerusalem, which Einstein helped to found and where most of his original documents reside, so that I could draw inspiration. What sublime experiences will researchers miss if they simply view the documents online? What will be lost if the archives, with their passionate staffs, morph into unvisited repositories?

Issacson, however, does express some excitement about the power of computing to help us ask new questions about these documents:

My brooding soon gave way to marveling about the benefits that will come when millions of curious people, with new technologies in hand, get to dive into the papers of historical figures. While I was doing research years ago for my biography of Franklin, the Packard Humanities Institute in Los Altos, Calif., was at work on a digital collection of his papers. After a lot of begging, I wheedled a beta version of the CD-ROMs. They let me search all of Franklin’s papers for specific concepts . . . with the new digital version of Einstein, I have been looking at the 237 times he talked about Palestine—and imagining what a smart researcher could do by tracing the evolution of the 6,720 times he used the phrase “light quanta.”

With online archives, research can be crowdsourced. Students from Bangalore to Baton Rouge can drill down into Einstein’s papers and ferret out gems and connections that professional researchers may have missed. That will reinforce a basic truth about the digital age: By empowering everyone to get information unfiltered, it diminishes the role of gatekeepers and intermediaries. Scholars and experts will still play an important role in historical analysis, but their interpretations will be challenged and supplemented by the wisdom of crowds.

I share Issacson’s enthusiasm for the experience of traveling to and conducting research at archival institutions. I’ve conducted research at many different institutions, but I will always fondly remember my own experiences at the Indiana State Library when I lived in Indianapolis. Although I didn’t know it when I first moved to Indy, it turned out that my house was within walking distance to ISL. There were many Saturdays filled with early morning research, lovely lunchtime walks around downtown, and more research in the afternoon. The detective work of research is fun in and of itself, but the adventure of traveling to a new place and soaking in the character of the surrounding area makes the archival experience sublime.

All of this said, however, I don’t find myself as pessimistic as Issacson about this experiential loss with the move to online archives. Doing research online is, of course, also an experience. Digging into archival resources like Google Books, HathiTrust, The Internet Archive, and Chronicling America requires the same sort of detective work and interpretive skills that one uses at a brick-and-mortar institution. And it’s hard to describe the jubilation you feel upon discovering a crucial primary source that you would have never found or had access to at your local archival institution. Likewise, while the tactile experience of holding a real document in your hands is very, very special, the best web designers and archivists can make digital primary sources equally (if not more) accessible to researchers by providing clear scans, zoom in/out functionality, and text transcriptions that make these documents more approachable and understandable (especially for students in a k-12 setting who may be unable to visit an archive in person).

It’s also important to keep Issacson’s thoughts on digitization in context. All of the digital primary source collections he mentions are from noteworthy great white men in U.S. history: George Washington, Thomas Jefferson, James Madison, Mark Twain, Thomas Edison, and Einstein. Historians and archivists pick and choose what history gets digitized, and it remains an open question as to what should be digitized for online publication and whether or not this effort to publish documents related to White Anglo-Saxon Protestant Males over ones connected to women and minorities merely duplicates the same dominant practices in the history book publishing industry since the nineteenth century. There are literally billions of primary source documents that could be digitized, but the lack of time, cost, and labor to digitize will prevent a sizable number of documents from going online in the foreseeable future. For every Smithsonian undertaking a “digitization strategic plan” there are probably hundreds of archival institutions that lack the ability to digitize anything in their collections. In sum, researchers understand that producing good scholarship means still going to the archives and digging through the actual sources – it can’t all be done online.

The real loss with online archives, as I see it, is the loss of interaction with all of the talented and helpful archivists who help researchers accomplish their goals. I suspect that most researchers don’t have tea with their archivists or bump into world-renowned historians of science at the archives like Issacson does, but almost all can recall an instance in which an archivist pointed them towards collection material they were unaware of, helped transcribe a document that seemed unreadable, or took the time to go into the back corner of a dark room to find requested documents. I can recall many such moments, and my own research over the years wouldn’t have been completed without the help of archivists. They are important people, and I think it’s safe to say that we’ll still need their services and expertise well into the future, whether online or offline.


The Internet as an Archive of 21st Century History

The Stream

“The Stream”

Several days ago I read a fine piece in The Atlantic from anthropologist Alexis C. Madrigal on real-time internet content/information delivery, what Madrigal refers to as “The Stream.” Whether it be Facebook, Twitter, Google Reader (R.I.P.), or the New York Times, many websites have turned to the stream as a means for instantly delivering information that is ostensibly meaningful to readers. The screenshot above is from the “Times Wire”–which is run by the New York Times–and it exemplifies the machinations of the stream: instant updates, individualized content, and and a sense of inclusion, by which I mean a feeling that you are keeping up with and understanding (at least somewhat) what’s going on in the world.

Madrigal explains the stream as such:

The Stream represents the triumph of reverse-chronology, where importance—above-the-foldness—is based exclusively on nowness. There are great reasons for why The Stream triumphed. In a world of infinite variety, it’s difficult to categorize or even find, especially before a thing has been linked. So time, newness, began to stand in for many other things. And now the Internet’s media landscape is like a never-ending store, where everything is free. No matter how hard you sprint for the horizon, it keeps receding. There is always something more.  Nowness also transmits this sense of presence, of other people, that you get in a city when you go to a highway overpass and look down at all the cars at any time of the day or night.

Given my recent embrace of Twitter and my belief in its enormous potential to deliver information to me that I find important, I am now more than ever a product of the stream. Rather than reading a newspaper, I now check my Twitter stream in the morning to see what’s happening, to find information that “newsworthy” to me. When I find content personally interesting, I contribute my own small part to the stream through tweets, Facebook posts, and essays on Exploring the Past. Since I started this regiment of blogging and tweeting one year ago, I’ve been blown away by the connections I’ve made with people all over the world and the number of visits I’ve had to this blog (more than 10,000 so far).

Yet there are times when I feel as if the stream overwhelms me. Sometimes I feel like I can’t get away. I try to work on projects, school assignments, etc., but the pull of nowness sucks me in, challenging me to stop work to check and see if I’m missing something important in the stream. Equally frustrating, these streams make little distinction between what Robin Sloan refers to as “flow” and “stock.” “Flow” refers to information designed for the here and now: updates and tweets about weather, daily activities, your pumpkin spice latte, etc. “Stock” refers to content that I’d argue is more than information in that it actually contributes to knowledge construction; material that you’d still refer to long after its incorporation into the stream.

Madrigal’s article raised larger questions within me about how we view the internet from a holistic viewpoint. If we rely on the stream for obtaining information, how do we promote and preserve meaningful flow and stock content for the long term? Can we break away from the pull of the now to make room for reflection on what has already occurred in recent memory?

Part of the solution, I think, is understanding that while the internet provides us meaningful information for the here and now, the internet should also be viewed as a historical, archived space. Sure, there are sites like the Internet Archive, Google Books, HathiTrust, and Chroncling America that provide public access to historical events, documents, and artifacts from the twentieth century and earlier, but how do we go about archiving the history we make every day through our interactions on the stream? Twitter, Facebook, Reddit, and other related sites are not just sources for nowness: they’re also tools and resources for future historians looking to interpret the history of the early twenty-first century.

Viewing the internet as a historical archive will require more discussion and questioning, as far too many website proprietors view the content and interactions on their websites as disposable rather than historical. Ian Milligan points out that major websites such as Yahoo! and MySpace have recently destroyed millions upon millions of historical digital records, embracing the notion of “who needs old stuff when the future is here?” In the case of MySpace, bloggers who used the world’s largest social media website from 2005-2008 to share their thoughts had their information wiped out instantly in June of this year. As Milligan argues, MySpace “meant something to multiple millions of people,” and future historians are now more impoverished thanks to this focus on the now.

How do you go about preserving your digital records? What would you do if Facebook, Twitter, or WordPress suddenly deleted all of your content, all of your flow and stock?