Travis Swicegood's real world data lessons from Texas Tribune

Travis Swicegood

Travis Swicegood, director of technology at  Texas Tribune, spoke this week at the latest Hacks/Hackers Chicago Meet-up about the challenges of working with public data — real world data, as Swicegood calls it.

There are plenty of challenges in collecting, managing and presenting data from a state the size of Texas — 26 million people, 254 counties, five major cities and a gross state economy of $1.2 trillion. Swicegood shared just a few of the challenges of managing and wrangling data:

“Data frequently, frequently, frequently disappears.”

Swicegood described “real world data” as being dirty and unpredictable. Despite those negative attributes, lost data is even worse. He has a copy of every dataset he works with at the Tribune — a habit he formed from after seeing data go missing from local locations as well as government websites. He also distinguished the Texas Tribune’s civic datasets from “big data.” While Tribune datasets may include millions of records, Swicegood noted data collections of, say, social media organizations are much larger and constantly growing.

“Making assumptions about the data is something that can get you in lots of trouble.”

Tribune beat reporters act as domain experts for data and in-depth “explorers.” The dedicated data folks might be best suited to manipulation and vizualization, but a beat reporter can help a developer understand datasets and explain apparent discrepancies. They've also got an ability to understand what makes them really interesting thanks to time on the beat.

“This is why more citizens don't grab big government datasets.”

Swicegood recounted the difficulties of collecting inconsistent school data from the 254 Texas counties for the Tribune's Public Schools Explorer. Not to mention the occasional 700-column wide CSV file and the reports from 141 agencies to compile the Tribune's famous state employee salary database.

Swicegood also shared his data tools and techniques for working with dirty, inconsistent data. He recommended analysts capture their calculations in scripts, document their data munging and use version control so their operations can be replicated when new data is released.

"Our CEO likes to say we're The Boy Who Lived."

The Tribune launched within months of two other non-profit news organizations, Chicago News Cooperative and Bay Citizen. Four years later Texas Tribune is the only one still running as originally envisioned — Chicago News Coop shut down in 2012, and Bay Citizen has partnered and rebranded a few times. Swicegood said the Tribune’s business model and focus on Texas (a state with plenty of wealthy donors) have helped it succeed.

“I like to say we're a technology company that produces a journalism based product.”

Texas Tribune has always been an online only publication. Swicegood’s slowly winning converts to his position … to the chagrin of some in the newsroom, he said.

Latest Posts

  • A Big Change That Will Probably Affect Your Storymaps

    A big change is coming to StoryMapJS, and it will affect many, if not most existing storymaps. When making a storymap, one way to set a style and tone for your project is to set the "map type," also known as the "basemap." When we launched StoryMapJS, it included options for a few basemaps created by Stamen Design. These included the "watercolor" style, as well as the default style for new storymaps, "Toner Lite." Stamen...

    Continue Reading

  • Introducing AmyJo Brown, Knight Lab Professional Fellow

    AmyJo Brown, a veteran journalist passionate about supporting and reshaping local political journalism and who it engages, has joined the Knight Lab as a 2022-2023 professional fellow. Her focus is on building The Public Ledger, a data tool structured from local campaign finance data that is designed to track connections and make local political relationships – and their influence – more visible. “Campaign finance data has more stories to tell – if we follow the...

    Continue Reading

  • Interactive Entertainment: How UX Design Shapes Streaming Platforms

    As streaming develops into the latest age of entertainment, how are interfaces and layouts being designed to prioritize user experience and accessibility? The Covid-19 pandemic accelerated streaming services becoming the dominant form of entertainment. There are a handful of new platforms, each with thousands of hours of content, but not much change or differentiation in the user journeys. For the most part, everywhere from Netflix to illegal streaming platforms use similar video streaming UX standards, and...

    Continue Reading

  • Innovation with collaborationExperimenting with AI and investigative journalism in the Americas.

    Lee este artículo en español. How might we use AI technologies to innovate newsgathering and investigative reporting techniques? This was the question we posed to a group of seven newsrooms in Latin America and the US as part of the Americas Cohort during the 2021 JournalismAI Collab Challenges. The Collab is an initiative that brings together media organizations to experiment with AI technologies and journalism. This year,  JournalismAI, a project of Polis, the journalism think-tank at...

    Continue Reading

  • Innovación con colaboraciónCuando el periodismo de investigación experimenta con inteligencia artificial.

    Read this article in English. ¿Cómo podemos usar la inteligencia artificial para innovar las técnicas de reporteo y de periodismo de investigación? Esta es la pregunta que convocó a un grupo de siete organizaciones periodísticas en América Latina y Estados Unidos, el grupo de las Américas del 2021 JournalismAI Collab Challenges. Esta iniciativa de colaboración reúne a medios para experimentar con inteligencia artificial y periodismo. Este año, JournalismAI, un proyecto de Polis, la think-tank de periodismo...

    Continue Reading

  • AI, Automation, and Newsrooms: Finding Fitting Tools for Your Organization

    If you’d like to use technology to make your newsroom more efficient, you’ve come to the right place. Tools exist that can help you find news, manage your work in progress, and distribute your content more effectively than ever before, and we’re here to help you find the ones that are right for you. As part of the Knight Foundation’s AI for Local News program, we worked with the Associated Press to interview dozens of......

    Continue Reading

Storytelling Tools

We build easy-to-use tools that can help you tell better stories.

View More