This is the source code behind our project to collect political ads on Facebook. Built versions are available for Firefox and Chrome. You can browse the American ads we've collected at ProPublica, and the Australian ads over on the Guardian's website.
We're asking our readers to use this extension when they are browsing Facebook. While they are on Facebook a background script runs to collect ads they see. The extension shows those ads to users and asks them to decide whether or not a particular ad is political. Serverside, we use those ratings to train a naive bayes classifier that then automatically rates the other ads we've collected. The extension also asks the server for the most recent ads that the classifier thinks are political so that users can see political ads they haven't seen. We're careful to protect our user's privacy by not sending identifying information to our backend server.
We're open sourcing this project because we'd love your help. Collecting these ads is challenging, and the more eyes on the problem the better.
You can find instructions on how to set up your own full-fledged version of the Facebook Political Ad Collector in INSTALLATION.md
There is an explanation of all the moving parts in ARCHITECTURE.md
In general, the project needs more tests. We've written a couple of tests for parsing the Facebook timeline in the extension directory, and a few for the tricky bits in the server, but any help here would be great!
Also, the rust backend needs a bit of love and care, and there is a bit of a mess in
backend/server/src/server.rs that could use cleaning up.
utm_content, etc., since those sometimes include useful metadata about the ad.)