Articles

Projects walkthroughs, tool teardowns, interviews, and more.

  1. Bots with Thoughts

    By Jacob Harris

    Posted on

    Jacob Harris on magic, aesthetics, and the newsbot frontier.

  2. A Botmaking Primer

    By Joseph Kokenge

    Posted on

    Not sure where to begin with this whole bot thing? Joseph Kokenge is here to help you get started with botmaking 101.

  3. Quakebots and Pageview Quotas: Bot or Be Botted?

    By Matt Waite

    Posted on

    Matt Waite on Daft Punk, algorithmic news, hamster wheels, and journalism’s Rushkoff moment.

  4. A Bot to Find the Source of Serendipity

    By Jacqui Maher

    Posted on

    Just before Thanksgiving last year, a new novelty Twitter account gained notice in our newsroom. @NYTMinusContext, promising “All Tweets Verbatim From New York Times Content. Not Affiliated with New York Times.” tweeted fragments from Times articles that you might not think twice about while reading in article format. Isolated, though, these phrases can be absurd, surprising, and delightful.

  5. An Open Source Bot Factory

    By Albert Sun

    Posted on

    Albert Sun from Interactive News team at the New York Times tells how they use Huginn, a Ruby on Rails project, to create automated agents and scheduled tasks.

  6. How We Made @NailbiterBot

    By Noah Veltman

    Posted on

    The first full round of March Madness is Christmas morning for college basketball fans: 2 days, 32 games, lots of upsets and late-game drama. Last week, on the first full day of the tournament, WNYC transportation reporter Jim O’Grady casually mentioned that he couldn’t keep tabs on all the action during the day. He wished he could get a text message whenever a game was coming down to the wire so he would know when to neglect his professional responsibilities and tune in for the end. I started kicking around the idea in my head a little, and after work my colleague Jenny Ye and I decided to take a break from writing weird JavaScript to write some more weird JavaScript. The result was @NailbiterBot, a humble Twitter bot that posts a tweet whenever an NCAA tournament game is close late in the second half.

  7. How to Break News While You Sleep

    By Ken Schwencke

    Posted on

    Around 6:25 a.m. I was awakened by a jolt from slipping tectonic plates. The tremor didn’t last very long, and as soon as my window stopped rattling my first thought was to check for an email.

  8. Welcome to Bot Week

    By Erin Kissane

    Posted on

    Automated news-gathering tools aren’t new, but they’re multiplying like crazy and getting quite a bit of attention. Little bots have also turned into interesting remixing devices and distribution channels, especially on Twitter. This week on Source, we’re going 100% bot.

  9. Newsroom Analytics: A Primer

    By Jacqui Maher

    Posted on

    Jacqui Maher says it’s not just the numbers, it’s what they mean about the audience.

  10. Inside the Predict-o-Tron

    By Jay Boice and Aaron Bycoffe

    Posted on

    The HuffPost Predict-o-Tron is a tool we built to let people make their own March Madness bracket predictions using basketball statistics, expert ratings, and results from the past four tournaments. There are some interesting tidbits to be found in the data, although they all need to be qualified with the understanding that model performance is based on only four years of data, which leaves us at risk of overfitting. This means that slider combinations that appear to do very well for the past four years may not continue to perform as well if expanded to the past 10 years. With that said, it looks like the experts are very good at picking a bracket, taller teams tend to do better than shorter teams, younger teams do better than older teams, and teams with more depth (both in scoring and playing time) do better than teams with less depth.

  11. How We Made the SOTU Twitter Visualization

    By Nicolas Belmonte and Simon Rogers

    Posted on

    People tweet what they think, when they think it—and, crucially, we wanted to provide a visualization for the State of the Union speech which reflected that. This wouldn’t be a (shudder) word cloud based on frequencies but a way to track the conversation on Twitter as it was directly influenced by the President’s speech.

  12. Event Roundup, Mar 17

    By Erika Owens

    Posted on

    Hacks/Hackers are meeting up in NYC, London, and Berlin this week. Plus, deadline for submissions for the Knight News Challenge to answer: “How can we strengthen the Internet for free expression and innovation?”

  13. You Got the Documents. Now What?

    By Jonathan Stray

    Posted on

    Jonathan Stray’s guide to turning documents into data you can run with.

  14. How We Are Exploring Mountains of Linked Data at BBC News Labs

    By Basile Simon

    Posted on

    I was asked to join BBC News Labs a couple a weeks ago to work on a project that, when it was first briefly explained to me by email, left me clueless about what it was about. (Imagine the discomfort before my job interview with Matt Shearer, Innovation Manager at the Lab.) The project is called #newsVane—and yes, we refer to it with the hash sign every time, don’t ask me why.

  15. Introducing Streamtools: A Graphical Tool for Working with Streams of Data

    By Mike Dewar

    Posted on

    We see a moment coming when the collection of endless streams of data is commonplace. As this transition accelerates it is becoming increasingly apparent that our existing toolset for dealing with streams of data is lacking. Over the last 20 years we have invested heavily in tools that deal with tabulated data, from Excel, MySQL, and MATLAB to Hadoop, R, and Python+Numpy. These tools, when faced with a stream of never-ending data, fall short and diminish our creative potential. In response to this shortfall we have created streamtools—a new, open source project by the New York Times R&D Lab which provides a general purpose, graphical tool for dealing with streams of data. It offers a vocabulary of operations that can be connected together to create live data processing systems without the need for programming or complicated infrastructure. These systems are assembled using a visual interface that affords both immediate understanding and live manipulation of the system.

  16. Event Roundup, Mar 10

    By Erika Owens

    Posted on

    Talking open data this weekend in Montevideo, plus lots of Hacks/Hackers events around the world this month.

  17. Developing Data Journalists in the Developing World

    By Eva Constantaras

    Posted on

    Eva Constantaras on training data journalists where data journalism isn’t a standard practice.

  18. Introducing Source Guides

    By Erin Kissane

    Posted on

    In the year-and-a-bit we’ve been publishing Source, we’ve built up a solid archive of project walkthroughs, introductions to new tools and libraries, and case studies. They’re all tagged and searchable, but as with most archives presented primarily in reverse-chron order, pieces tend to attract less attention once they fall off the first page of a given section. We’ve also been keeping an eye out for ways of inviting in readers who haven’t been following along since we started Source, and who may be a little newer to journalism code—either to the “code” or the “journalism” part.

  19. Human-Assisted Reporting Gets the Story

    By Tyler Dukes

    Posted on

    Tyler Dukes on combining the power of data-sorting tools with old-fashioned digging.

  20. A Map That Wasn’t a Map

    By Tasneem Raja

    Posted on

    If you want to show information with a geographical component, you should start with a map, right? Not so fast, writes Tasneem Raja. Questioning your assumptions can help you make something much more effective.

Current page