September 8, 2014 by Michael Sauers

What can we learn from 800,000 public comments on the FCC’s net neutrality plan?

On Aug. 5, the Federal Communications Commission announced the bulk release of the comments from its largest-ever public comment collection. We’ve spent the last three weeks cleaning and preparing the data and leveraging our experience in machine learning and natural language processing to try and make sense of the hundreds-of-thousands of comments in the docket. Here is a high-level overview, as well as our cleaned version of the full corpus which is available for download in the hopes of making further research easier.

Our first exploration uses natural language processing techniques to identify topical keywords within comments and use those keywords to group comments together. We analyzed a corpus of 800,959 comments. Some key findings:

We estimate that less than 1 percent of comments were clearly opposed to net neutrality¹.

At least 60 percent of comments submitted were form letters written by organized campaigns (484,692 comments); while these make up the majority of comments, this is actually a lower percentage than is common for high-volume regulatory dockets.

At least 200 comments came from law firms, on behalf of themselves or their clients.

Below is an interactive visualization that lets you explore these groupings and view individual comments within the groups.

Read the full article @ The Sunlight Foundation.

Published by Michael Sauers

Michael Sauers is the Director of Logan Library in Logan, UT. Prior to this he was one of the founding staff and Technology Manager for Do Space in Omaha, NE. After earning his MLS in 1995 from the University at Albany's School of Information Science and Policy Michael spent his first 20 years as a librarian training other librarians in technology along with time as a public library trustee, a bookstore manager for a library friends group, a reference librarian, a technology consultant, and a bookseller. He has written dozens of articles for various journals and magazines and has published 14 books ranging from library technology, blogging, Web design, and an index to a popular horror magazine. In his spare time, he blogs at TravelinLibrarian.info, runs The Collector's Guide to Dean Koontz website at CollectingKoontz.com, takes many, many photos, and typically reads more than 100 books a year. Unless otherwise stated, all opinions are my own and are not to be considered those of the City of Logan, UT. View all posts by Michael Sauers

Leave a Reply