Alpha release: SoundCite makes inline audio easy and seamless

Over the weekend we released SoundCite, a tool that lets anyone easily include inline audio in their stories. We have open-sourced our code on GitHub and would encourage you to contribute in any way you wish. Fork it, send us pull requests or just let us know what you think.

SoundCite helps users create short audio clips from files on SoundCloud. Users insert those clips into the text of a story with an embed code generated by SoundCite.

The project has been a fun challenge for me personally and about 10 months in the making. Last year my professor, Jeremy Gilbert, offered a challenge to our Advanced Interactive and Design class for our final project. He told us to design the future of news, whatever that may be. Students prototyped interactive touchscreen kitchen tables, news sources that upped the ante on the future of curation and new ways to track events across a community. They were bold, grand ideas for an increasingly connected future of journalism.

I approached the project at a more granular level. I asked myself, “How can we tell our individual stories better in a digital age?” I decided that we needed to develop better ways to semantically attach pertinent media to our words. Storytelling on the web can combine visual and aural media with text in a way no other medium can. Given my love of music and all things audio, I chose to tackle bridging the gap between aural and textual media. The solution I arrived at was SoundCite.

My initial prototype was nothing more than a browser hack, but the core ideas were in place and still hold true with today's product. With SoundCite, a writer can take an audio source hosted on SoundCloud, select a start and end point, and create an audio clip that embeds neatly into text, much like a hyperlink. There is no external audio player, no interface and no distractions. SoundCite is a tool for unifying audio and text to tell your story better.

The prototype eventually made its way to the Knight Lab, and for the past nine months, we've been working hard to develop this idea into a fully capable product that any publisher can use. Our current alpha release is not quite there yet, but we hope you start testing it privately in your storytelling and thinking of ways that SoundCite can work for you as we continue improving towards a beta release. We have plenty of ideas to get you started.

Before I began programming, I fancied myself a music critic. Yet no matter how much I practiced the craft, I found it impossible to translate the sounds I heard into words that any reader would understand. Ironically, writing about the actual music was the hardest part of writing about music. If I could just point the reader to the snippet of music I wanted to discuss, I could write the parts I really wanted to write: the analysis.

SoundCite works perfectly in this referential style. Indeed, its name derives from that point: the sounds you link to text are citations to the source you are referencing. Here's an example I adapted from a Pitchfork review of Tame Impala's 2012 single, “Elephant”:


This was my initial use case for SoundCite, but as we continued to develop the product, we discovered even more compelling use cases. The true power of using audio is the emotional depth it can bring to a story. When broadcast radio was introduced, families gathered around radios in their homes to hear the latest episode of Amos 'n' Andy or The Jack Benny Program. Instead of reading and imagining a character's voice, they could hear it. “Wait, Wait, Don't Tell Me!” exists today because of this power.

SoundCite can be used in this way to add emotional depth and voice to your stories. In the months I have been developing the product, I have come across countless stories where SoundCite would have served a story well, but none made me wish I had worked faster than the Los Angeles Times' feature on delayed 911 calls. Recognizing the power of the voices in these calls, they chose to upload a particular call to SoundCloud and embed the entire 11-minute call at the top of their story so readers could listen. Meanwhile, the writers referenced the call throughout the story, weaving their own narrative. Here's what SoundCite can do to make these two mediums work together:

Sometimes, the natural sound of an environment you are writing about is, much like music, impossible to describe, but to take the reader to that place, you as a writer must try. SoundCite not only solves that problem, but enhances the story to make a truly immersive experience.

This goes beyond idyllic nature sounds for travel stories, though certainly, SoundCite can work for that style. The chaos of the Arab Spring left behind all kinds of multimedia artifacts that made the protests so much more vivid. With SoundCite, those primary source sounds can become tools for immersing readers into the scene. Here’s SoundCite applied to a New York Times story:

These are just a few of the many ways SoundCite can enhance a story. SoundCite can be used for building interactive transcripts during crucial Supreme Court hearings, fact-checking politicians against their own words, capturing a play-by-play call of a buzzer-beating three-point shot and so much more that we haven't even thought of yet.

SoundCite is still an alpha product, and certainly not ready to be publicly used in a story. We have many bugs to squash and features to add, including creating more than one clip at one time and customizing your SoundCite embed.

If you want to try it out, head over to our website and use our clip creator to generate the embed code! The process requires no technical experience beyond copying and pasting some pre-generated code into your post editor, just like a YouTube video.

Again, check out the code on GitHub feel free to contribute in any way you wish! Fork it, send us pull requests or just let us know what you think. We want all the feedback we can get as we build towards a beta release in the coming weeks. Stay tuned.

About the author

Tyler Fisher

Undergraduate Fellow

Latest Posts

  • With the 25th CAR Conference upon us, let’s recall the first oneWhen the Web was young, data journalism pioneers gathered in Raleigh

    For a few days in October 1993, if you were interested in journalism and technology, Raleigh, North Carolina was the place you had to be. The first Computer-Assisted Reporting Conference offered by Investigative Reporters & Editors brought more than 400 journalists to Raleigh for 3½ days of panels, demos and hands-on lessons in how to use computers to find stories in data. That seminal event will be commemorated this week at the 25th CAR Conference, which...

    Continue Reading

  • Prototyping Augmented Reality

    Something that really frustrates me is that, while I’m excited about the potential AR has for storytelling, I don’t feel like I have really great AR experiences that I can point people to. We know that AR is great for taking a selfie with a Pikachu and it’s pretty good at measuring spaces (as long as your room is really well lit and your phone is fully charged) but beyond that, we’re really still figuring...

    Continue Reading

  • Capturing the Soundfield: Recording Ambisonics for VR

    When building experiences in virtual reality we’re confronted with the challenge of mimicking how sounds hit us in the real world from all directions. One useful tool for us to attempt this mimicry is called a soundfield microphone. We tested one of these microphones to explore how audio plays into building immersive experiences for virtual reality. Approaching ambisonics with the soundfield microphone has become popular in development for VR particularly for 360 videos. With it,...

    Continue Reading

  • Prototyping Spatial Audio for Movement Art

    One of Oscillations’ technical goals for this quarter’s Knight Lab Studio class was an exploration of spatial audio. Spatial audio is sound that exists in three dimensions. It is a perfect complement to 360 video, because sound sources can be localized to certain parts of the video. Oscillations is especially interested in using spatial audio to enhance the neuroscientific principles of audiovisual synchrony that they aim to emphasize in their productions. Existing work in spatial......

    Continue Reading

  • Oscillations Audience Engagement Research Findings

    During the Winter 2018 quarter, the Oscillations Knight Lab team was tasked in exploring the question: what constitutes an engaging live movement arts performance for audiences? Oscillations’ Chief Technology Officer, Ilya Fomin, told the team at quarter’s start that the startup aims to create performing arts experiences that are “better than reality.” In response, our team spent the quarter seeking to understand what is reality with qualitative research. Three members of the team interviewed more......

    Continue Reading

  • How to translate live-spoken human words into computer “truth”

    Our Knight Lab team spent three months in Winter 2018 exploring how to combine various technologies to capture, interpret, and fact check live broadcasts from television news stations, using Amazon’s Alexa personal assistant device as a low-friction way to initiate the process. The ultimate goal was to build an Alexa skill that could be its own form of live, automated fact-checking: cross-referencing a statement from a politician or otherwise newsworthy figure against previously fact-checked statements......

    Continue Reading

Storytelling Tools

We build easy-to-use tools that can help you tell better stories.

View More