Ruby web scraping tutorial on morph.io – Part 1, setting up your scraper

This post is part of a series of posts that provide step-by-step instructions on how to write a simple web scraper using Ruby on morph.io. If you find any problems, let us know in the comments so we can improve these tutorials.


 

With just a few lines of code, you can write a scraper to collect data from messy web pages and save it in a structured format you can work with.

This tutorial will take you through the process of writing a simple scraper. This tutorial uses the Ruby programming language, but you can apply the steps and techniques to any language available on morph.io.

Over this tutorial you will:

  • create a scraper on morph.io
  • clone it using git to work with on your local machine
  • make sure you have the necessary dependencies installed
  • write scraping code to collect information from a website
  • publish and run your scraper on morph.io

In this first instalment you’ll create a scraper, clone it to your machine, and install

You’ll use morph.io, the command line and a code editor on your local machine.

Let’s get started.

Find the data you want to scrape

In this tutorial you’re going to write a simple scraper to collect information about the elected members of Australia’s Federal Parliament. For each member let’s capture their title, electorate, party, and the url for their individual page on the Parliament’s website.

The data you want to scrape needs to be available on the web. We’ve copied a basic list of members from the Parliament’s website to https://morph.io/documentation/examples/australian_members_of_parliament for practice scraping. You will target this page to get the member information with your scraper.

senators_and_members

The simplified list of Australian MPs for you to scrape on morph.io

Some web pages are much harder to scrape than others. The member information you’re trying to collect is published in a simple HTML list, which means you should be able to target and collect the information you want quite easily. If the information was in an image or PDF then it would be much harder to access programmatically and therefore much harder to write a scraper for.

Now that you’ve found the data you want to scrape and you know you can scrape it, the next step is to set up your scraper.

Create your scraper on morph.io and clone it to your machine

The easiest way to get started is to create a new scraper on morph.io.

Select the language you want to write your scraper in. This tutorial uses Ruby, so let’s go with that.

new_scraper

Fill out the new scraper form

If you are a member of organisations on GitHub, you can set the owner of your scraper to be either your own account or one of your organisations.

Choose a name and description for your scraper. Use keywords that will help you and others find this scraper on morph.io in the future. Let’s call this scraper “tutorial_members_of_australian_parliament” and describe it as “Collects members of Australia’s Federal Parliament (tutorial)”.

Click “Create Scraper”!

After morph.io has finished creating the new scraper you are taken to your fresh scraper page. You want to clone all the template scraper code morph.io provides to your local machine so you can work with it there.

On the scraper page there is a heading “Scraper code”, with a button to copy the “git clone URL”. This is the link to the GitHub repository of your scraper’s code. Click the button to copy the link to your clipboard.

clone_repository

Commands you’ll need to enter to clone your repository

Open your computer’s command line and cd to the directory you want to work in. Type git clone then paste in the url you copied to get something like:

git clone https://github.com/username/tutorial_members_of_australian_parliament.git

This command pulls down the code from GitHub and adds it to a new directory called nsw_parliament_current_session_bills. Change to that directory with cd tutorial_members_of_australian_parliament and then list the files with ls -al. You should see a bunch of files including:

  • scraper.rb, the file that morph.io runs and that you’ll write your scraping code in
  • Gemfile, which defines the dependencies you’ll need to run your scraper.

Now that you have the template scraper on your local machine, you need to make sure you have the necessary software installed to run it.

Installing Ruby

Installing Ruby is out of the scope of this tutorial but there are lots of good guides on the web. You might like to use something like RailsInstaller that takes care of this for you. Tools like rbenv or rvm can also be helpful for installing and switching Ruby versions on your computer.

Install the required libraries

In the Gemfile, you’ll see a Ruby version and two libraries specified:

ruby "2.0.0"

gem "scraperwiki", git: "https://github.com/openaustralia/scraperwiki-ruby.git", branch: "morph_defaults"
gem "mechanize"

This is template code that helps you get started by defining some basic dependencies for your scraper. You can read more about language versions and libraries in the morph.io documentation.

You can use Bundler to manage a Ruby project’s dependencies. Run, bundle install on the command line to check the Gemfile and install any libraries (called gems in Ruby) that are required.

So far you’ve set up all your files, cloned them to your machine, and installed the necessary dependencies. In our next post it’ll be time to write your scraper!

Posted in Morph | Tagged , , | Leave a comment

Matthew and the Digital Transformation Office

The Digital Transformation Office was established in July of this year, by the then Minister, now Prime Minister Malcolm Turnbull, to transform government and make government services simpler, clearer, faster and more humane.

The cornerstone of this huge endeavour is putting users first. Design and build government services for people, not for government and all kinds of amazing things will happen.

This is a mission that is close to my heart. After all Kat Szuminska and I started the OpenAustralia Foundation with the goal of getting people more actively involved in the political process. This led us to create services with people at the centre.

The creation of the Digital Transformation Office brings a once in a generation opportunity to make things better and I want to help realise this amazing opportunity.

That is why I’m joining the Digital Transformation Office.

In order for the OpenAustralia Foundation to clearly and definitively maintain its independence I will step down as a director of the foundation.

This is certainly not a decision I take lightly. On a personal level I would have loved to stay on as a member of the board but with a much less active day-to-day involvement with the running of the foundation and the development of its projects. However, that would not have been the best thing for the foundation. In very practical terms the foundation needs to be able to praise government when it is doing the right thing and criticise government when it is doing the wrong thing. This is only possible with true independence.

I won’t be disappearing. In fact I still plan to contribute as a volunteer.

The foundation is People.

I want to thank Henare Degan for his unflailing “getting shit done” attitude and approach – nothing is too hard and little is too serious to not laugh about.

I want to thank Kat Szuminska for inciting me at just the right time with some carefully chosen words. She’s the true radical, yet ever-patient, with a keen bullshit detector and always the one to ask the questions that get to the core of a problem.

And to all the volunteers and donors over the years, there are far too many of you to mention here. Thank you for everything. Thank you for your support. Thank you for everything you’ve done.

For me everything started with a talk in 2004 given by Tom Steinberg, Tom Loosemore and Stefan Magdalinski. It was the launch of a new website TheyWorkForYou.com. It was really an accident that I was there. Little did I know where it would lead. Thank you to Tom, Tom and Stef for that.

For inventing what we now know as civic tech I want to thank Julian Todd, Francis Irving, Chris Lightfoot, Matthew Somerville and Tom Steinberg. You have all been a continuing source of inspiration to me. Thank you.

I’m very proud of what we’ve achieved at the OpenAustralia Foundation. We’ve helped millions of Australians connect with their communities, governments and politicians. We’ve made tools to help people create the change they want to see.

With Henare, Kat and Luke, the OpenAustralia Foundation is in excellent hands and I very much look forward to seeing it develop and grow into the future. I can’t wait to see what they will make next!

The foundation, not being part of government, and with full independence is still in a unique position to do things that no one else can do. What we need for Australia is a rich and diverse ecosystem of governmental organisations and non-governmental organisations all working together for the best possible outcome for citizens.

This is one of many reasons why what the OpenAustralia Foundation does is now more important than ever.

Posted in Announcement, OpenAustralia Foundation | Tagged , , | 4 Responses

Civic Tech Monthly, September 2015

Welcome to the eighth edition of Civic Tech Monthly. Below you’ll find news and notes about civic tech from Australia and around the world.

As always we’d love to see you at the OpenAustralia Foundation Sydney Pub Meet
next Tuesday if you’re in town. Come along and give a lightning talk about something interesting in civic tech you’ve seen or done.

If you know someone who’d like this newsletter, pass it on: http://eepurl.com/bcE0DX.

News and Notes

A massive week for They Vote For You

Australia got a new Prime Minister last week. As the media detailed the step by step drama, tens of thousands of people visited They Vote For You to see how the players had voted in Parliament.

Over 30,000 people have visited They Vote For You since the leadership challenge was announced. This is over three times as many people as on the project’s launch day when it featured prominently in the Guardian Australia. In a week of so many words, it was fantastic to see people looking for some real information about what MPs do in Parliament.

Web scraping workshop success—we’re doing it again in October

We ran our first Introduction to Web Scraping Workshop a few weeks ago and it was a big success.

You can read more about how it went on information graphics designer Kelly Tall’s blog. We also blogged about the things we learned in this first experiment. We’re looking forward to seeing what everyone does with their new scraping skills!

We were really pleased and impressed with how much everyone learned—so we’re doing it all again. We’re still locking down our date and venue, but it will almost certainly be on Sunday, 25th October, near Central in Sydney. There will be 10 places available.

If you want to learn how to scrape structured data for your projects then let us know you’re interested via email or twitter. Then we can contact you when registrations open shortly.

EveryPolitician – 200 countries and counting

For Global Legislative Openness Week 2015 EveryPolitician set the ambitious target to add 66 more nations to their project and pass the 200 mark—and they did it! People from all around the world contributed research and scrapers to hit the goal.

EveryPolitician is an open, free repository of information on politicians from (now) over 200 national parliaments that you can use in your projects.

FOI and eyes wide shut: even public servants want to know

Suelette Dreyfus made a clear case for a more open, reliable Freedom of Information system in Australia to a conference hall full of public servants in August. Read a transcript of the address in The Mandarin.

Dreyfus argues that a more open government isn’t only in the public interest, but can also protect the independence of public servants and foster public trust in government.

Freedominfo.org is creating a deliberative process exemption library

In Freedom of Information law deliberative process exemptions are rules designed to protect the open and frank flow of advice and discussion inside government by restricting it from public access.

Freedominfo.org is creating a library of model examples of these rules and summaries of national laws that you can compare. How do your government’s rules compare to others? What would you change?

If you’re government isn’t listed yet you can contribute it to the resource.

Economist Joseph Stiglitz discusses the justification for these kinds of public access exemptions and their impact in his 1999 lecture on the importance of transparency to good governance (PDF).

Leadership transition at Open North

Canadian civic tech organisation Open North have a new executive director Jean-Noé Landry as founder James McKinney moves on to new things.

We’re looking forward to seeing how Open North continue to grow with a new director and can’t wait to see what James does next.

In this post James shares his current thoughts about where he fit as an individual inside the organisation he founded and looks back on his approach as executive director.

FoxScan

Since foxes were introduced into Australia in 1871 they have caused huge damage to native species. Foxes are a designated pest throughout the country.

FoxScan gives information to people trying to fix this problem. You report sightings of foxes and the damage they cause and Foxscan—rather than just sucking all this data into a black hole—tries to make it available to everyone. It shares some similarities with GrowStuff in that it encourages people to share disparate information and helps practitioners working independently to contribute to a social good.

morph.io got a big new server and continues to grow

A couple of months ago we celebrated the 3000th scraper running on morph.io. In the last week we passed 3600 scrapers and there’s now 3000 people using the platform—yikes!

With all this new use morph.io’s server was beginning to struggle. Some scrapers were also requiring more memory than we could allocate them.

After a helpful discussion with some of the most active morph.io users we upgraded to a bigger, more powerful server. You can now scrape bigger documents and run more memory intensive processes, as we increased the memory allowance for your scraper runs from 100 MB to 512 MB.

If morph.io is useful to you, please become a supporter to keep it running and open to all.

Posted in Civic Tech Monthly | Tagged , , , , , , , , , , , , , , | Leave a comment

A massive week for They Vote For You

Forget what politicians say. What truly matters is what they do. And what they do is vote, to write our laws which affect us all.
They Vote For You

Australia got a new Prime Minister last week. As the media detailed the step by step drama, tens of thousands of people visited and shared They Vote For You to see how the players had actually voted in Parliament.

Well over 30,000 people have visited They Vote For You since the leadership challenge was announced. This is over three times as many people as on the project’s launch day when it featured in the Guardian Australia’s comment section and Datablog.

Most people looked at the new Prime Minister Malcolm Turnbull’s voting record, while others looked up their own representative or the newly promoted ministers when they were announced. Dozens more people have subscribed to be notified when policies that interest them are updated and we even saw one wonderful person start contributing summaries of divisions. The vast majority of people came from Facebook (70%!), and over 90% came from social media more generally.

In a week of so many words, it was fantastic to see people looking for and sharing real information about what MPs do in Parliament. We hope to see more people holding their representatives accountable for their votes on the laws that effect our society.

Henare went on FBi Radio’s Backchat on Saturday morning to talk about the project and why people are looking for the kind of information it provides.

Posted in Media, They Vote For You | Tagged , , , , , , | Leave a comment

Web scraping workshop success—we’re doing it again in October!

Photo of Henare from OpenAustralia Foundation with the wonderful attendee of the first Introduction to Web Scraping Workshop, September 2015

Henare from OpenAustralia Foundation with the wonderful attendees of the first Introduction to Web Scraping Workshop, September 2015

Earlier this month we ran our first Introduction to Web Scraping Workshop and it was a big success! We had 6 wonderful attendees. By the afternoon everyone had written a scraper to collect data from the web.

You can read more about how it went on information graphics designer Kelly Tall’s blog. We’re looking forward to seeing what everyone does with their new scraping skills!

We owe a huge thanks to the Media, Entertainment, & Arts Alliance for providing a great venue and to Fleetwood Macciatto for a delicious lunch.

We were really pleased and impressed with how much everyone learned—so we’re doing it all again.

If you want to learn how to scrape structured data for your projects then…

The next Introduction to Web Scraping Workshop is Sunday, 25th October, near Central in Sydney.

Register here

There are 10 places available.

If have any questions please email (contact@oaf.org.au) or contact us on twitter (@OpenAustralia).

Things we learned from our first workshop

Pairing works well

At the OpenAustralia Foundation we’ve found Pair programming to be one of the best ways to learn together and transfer knowledge between people.

In planning the workshop we thought pairing would be a good way for people to learn and problem solve, and also that it would be useful to get some experience working in this style.

We paired attendees up to write their scrapers and it was a big success. The pairs had fun working through problems together and were able to remind each other of the techniques they’d been shown when necessary.

We got very positive feedback on this format, so we’ll definitely do it again.

A reference guide would have been useful

We didn’t provide hand-outs with a step by step guide, but rather focused on a detailed, live walk-through of writing a real scraper.

While the live programming was an engaging and realistic intro to scraping, a simple list of methods and techniques could have been a useful aid during and after the workshop. We’ve since had feedback that a resource like this would help avoid easy mistakes after the workshop, such as missing the bundle exec command before running irb or ruby scraper.rb. We’ll prepare something like this for the next event.

Luke drafted a detailed guide to writing a first scraper based on Henare’s demonstration in the workshop—hopefully we can draw on this to create a simpler list of steps and techniques for attendees.

Setup is tricky

Getting your laptop set up for scraping is not straightforward. Only 3 of the 6 attendees managed to get Ruby and the necessary scraping libraries installed for the workshop. This wasn’t a problem on the day because we had already planned to be working in pairs, but it would be better if everyone went home with their laptop ready to run the scraper they had written.

We’re not sure of the best way to handle this problem. We want to keep the workshop focused on scraping rather than get bogged down installing Ruby.

We’re very open to ideas on this. We’ll start by recommending that attendees try a free Ruby on Rails Installfest event that aims to help people get past this often confusing step.

These workshop events are still an experiment for the OpenAustralia Foundation so we’re very open to ideas and keen for feedback.

Remember to get in contact if you want to attend the next workshop in late October as space is limited. You can email us at contact@oaf.org.au.

Posted in Event | Tagged , , , | 3 Responses

Civic Tech Monthly, August 2015

Welcome to the seventh edition of Civic Tech Monthly. Below you’ll find news and notes about civic tech from Australia and around the world.

It seems like August has been a busy month for civic hackers everywhere because this month we’ve got a bunch of new projects from around the world for you.

As always we’d love to see you at the OpenAustralia Foundation Sydney Pub Meet next Tuesday if you’re in town. Last month’s lightning talks and new venue in Surry Hills were a big success. We’re doing it all again this month so come along and tell us something interesting in civic tech you’ve seen or done.

If you know someone who’d like this newsletter, pass it on: http://eepurl.com/bcE0DX.

News and Notes

Introduction to Web Scraping Workshop in Sydney

Join us in Sydney in two weeks time to learn how to write a web scraper. You can use a scraper to quickly grab all kinds of information for analysis and processing. Scrapers are the backbone of OpenAustralia Foundation projects such as They Vote For You and PlanningAlerts—we’re always finding new ways that scrapers can help, and we’re keen to share this skill. You can find out more and register via our blog post.

This is a hands-on, half-day workshop and by the afternoon you will have written a web scraper. Tickets are $295 and the workshop is in Redfern, Sydney on Friday the 4th of September. You will need a laptop and some programming experience to attend but you don’t have to be an expert. If you know what a variable, loop and array are, then this is for you.

This is a new experiment for the OpenAustralia Foundation. We’re trying out a new approach to help people who want to make their own projects. If you can’t make the workshop, but know someone who might be interested, we’d really appreciate if you could pass this along.

Thank you Tom

It was Tom Steinberg’s last day at mySociety earlier this month. He’s made a huge contribution to civic tech everywhere. It’s been our absolute pleasure to work with him over the years and we’d like to especially thank him for being generous with his advice whenever we asked. We can’t wait to see what he gets up to after his chillax and we wish him very well.

We also love this delightfully mad post from the excellent Francis Irving celebrating Tom and all the other amazing people that have contributed to mySociety over the years – “Those brief moments when winning seems possible”.

Soft launch of Freedom of Information Portal for Malaysia

You can now use Malaysia’s first Freedom of Information portal to ask their governments for information. Of course it’s built on Alaveteli!

By law, we have Freedom of Information Enactments (FOIE) in the following two states: Selangor & Penang. However, we have also added the avenue to request for information from Public Authorities not covered by FOI in order to gauge what citizens want to know at the Federal level.

This is a nice example of one of the Civic Patterns we like to follow at OAF: “When designing a service, make your process reflect the legal rules that you wish existed, instead of those that do. Reality will catch up.”

Yo Quiero Saber 2015

This month Argentines got to compare the basic positions of candidates using a simple game. First you have to state your position on an issue, then the you see the positions of the candidates. This dynamic leads you through to see where you stand on a range of issues compared to to all the candidates.

The feedback so far is that people felt informed and used it to pick candidates that better represented their views. Martín Szyszlican, one of the creators, says they’ll keep running and developing the project, and that the big challenge is to reach more people. This time they had over 1% of voters, doubling their audience in 2013, but not enough to impact election results.

Help work out the gender balance of 100 parliaments

Which country has the highest proportion of women in parliament? Do women vote differently on issues like defence, the environment, or maternity benefits? Exactly when did women come into power in different countries, and did their presence change the way the country was run? Frustratingly, these are questions for which it’s difficult to provide an answer, because the objective data just isn’t there … So we created Gender Balance, an easy game that crowd-sources gender data across every parliament in the world.

Believe it or not, but you can have fun, learn about your parliament, and generate useful open data all at the same time. Get to it!

OpenPlanning Launches in Hampshire, UK

Hampshire Hub have teamed up with mySociety to prototype a tool to demystify the local planning process. This is a new take on making local planning records more useful. You can read more in Ben Nickolls blog post on the project.

In excellent open source fashion, this prototype is forked from the OpenAustralia Foundation’s PlanningAlerts, which in turn is based the UK’s original planningalerts.com. As civicpatterns.org says “Don’t Reinvent The Wheel”

It looks like the team is considering how to merge some of their upgrades back into PlanningAlerts.org.au, which would be truly fantastic.

Dare to talk about your civic tech mistakes — submit your failure story

In a small room full of friends, talking about failure should be easy. But for some reason — maybe because of the relative novelty of using more expensive technologies for social innovation — people working around civic technology are not used to admitting when their projects don’t work. And while this sort of dishonesty might help with short-term opportunities (especially when it comes to funding), it has a serious effect on long-term sustainability: We cannot learn from our mistakes.

Tell the Sunlight Foundation about your unsuccessful projects so they can work out the patterns and help us all improve our work.

CKAN meetup and Hacks/Hackers in Sydney

In more Sydney event news, after the scraping workshop we can walk down to the CKAN meetup. There’ll be lots of people with experience working with and publishing open data.

Hacks/Hackers is also back on in Sydney. The next meetup, Data Journalism and Investigations in Political Reporting, is on Wednesday evening, September 16. You should give a lightning talk if you’re in town.

How far does your MP tread the party line?

You can greatly improve or reduce the usability of a resource just by changing its text. This is an interesting post about the impact of wording in a civic tech site and how the TheyWorkForYou team approached improving it.

101 web scraping and research tasks for the data journo (or civic hacker)

If you want to learn scraping or polish your skills (and can’t make it to our workshop ;-) ) then here’s 101 tasks to keep you busy.

Help develop the Influence Mapping Toolbox

You can help the development of new tools to map the role of personal ties and economic interests in politics. If you have an influence mapping project, or are interested in getting one started, you can help the team with their initial research and shape development of their new tools. You’ll also find lots of interesting discussion on the topic of influence mapping at the Influence Mapping Google Group.

Posted in Civic Tech Monthly | Tagged , , , , , , , , , , , , , , , , , , , | Leave a comment

Introduction to Web Scraping Workshop in Sydney

Web scraping is a flexible and powerful technique to collect data for your projects. You can use a scraper to quickly grab all kinds of information for analysis and processing. Scrapers are the backbone of our projects such as They Vote For You and PlanningAlerts— we’re always finding new ways that scrapers can help.

There’s been a lot of interest in our scraping platform morph.io recently, and in web scraping generally. After seeing how quickly people can get productive with scrapers at our two recent events, we’ve been thinking about how we can continue to share these skills. We decided to organise a half-day training course, which we’re announcing today!

Join us for the Introduction to Web Scraping Workshop, Friday September 4th in Sydney.

In this hands on, half-day workshop you’ll get experience creating a scraper with developers from the OpenAustralia Foundation. By the afternoon you’ll have written a scraper to collect data from websites daily.

Tickets are $295 and we’ve limited it to 16 spaces so everyone gets lots of attention. As always, we are a charity so all revenue goes directly back into the development and maintenance of our projects.

Henare Degan and Luke Bacon from the Foundation will be leading the workshop.

Who should attend?

If you’re a data journalist, app developer, researcher, or you just need to quickly collect and track information, then knowing how to scrape data from the web will save you time and open up new opportunities.

PLEASE NOTE: To attend this workshop you’ll need some basic experience with programming, web development and using GitHub. You will need to bring a laptop to write your scraper on.

In the workshop we will be writing scrapers in the Ruby programming language, but you do not need experience with Ruby. Beginner level knowledge in any language is fine (e.g. Javascript, PHP, Python, etc.). You’ll be able to take what you learn and use it with any language.

If you have any questions about the event please contact us.

Thanks to the Media, Arts and Entertainment Alliance for hosting us at their lovely venue in Redfern.

Posted in Event | Tagged , , , , | Leave a comment

Civic Tech Monthly, July 2015

Welcome to the sixth edition of Civic Tech Monthly. Below you’ll find news and notes about civic tech from Australia and around the world.

It’s been very cold here in Sydney. People were even snowboarding down the main street of Katoomba, where our co-founders live.

Maybe that means people have been hibernating because this month we started with slim pickings for the newsletter. Fortunately we’ve managed to pull together some interesting links to share with you nonetheless. Remember, if you’d like to submit your own links you can do so right on the GitHub repository where we prepare this newsletter.

Folks coming to the Sydney Pub Meet on Tuesday are in for a treat this month. Anyone can give a lightning talk to share something interesting in civic tech they’ve seen or done. This means we’ve also got a lovely new venue – the Trinity Bar in Surry Hills. We’ve also got more room so there’s still spots available for you to RSVP and come along.

Know someone who would like these newsletters? Pass on the sign up link http://eepurl.com/bcE0DX.

News and Notes

Citizens vs Customers: Democracy in the age of Google

Check out the video of the Google Tech Talk that Matthew Landauer gave prior to the Google Serve event we mentioned in last month’s edition. You can also read more about what we achieved with a little scraping and the help of some civic-minded Googlers in Luke Bacon’s blog post, “Another 2 million people can get PlanningAlerts“.

Become a supporter of morph.io

Speaking of scraping, this month we added a way for people to support our free and open scraping platform, morph.io. By becoming a supporter you can keep morph.io free for everyone and make it even more awesome. There’s also the option to get priority technical support if you need it. A huge thank you to the wonderful people who have already shown their support. Please give them a hug if you see them.

Unearthing interesting data

There are almost 3,000 scrapers on morph.io which means there’s all sorts of interesting data to play with. Here’s a few that caught our eye this month:

What interesting data have you seen? Tweet us: @morph_io

Hello to mySociety’s new CEO, Mark Cridge

Like so many people we were sad to hear that Tom Steinberg, mySociety’s founder, was stepping down. Now we’re excited to welcome Mark Cridge as their new CEO. We’re looking forward to working with Mark in the future and we hope to meet him soon.

Alaveteli 0.22

While we’re on the topic of mySociety, they’ve just released Alaveteli 0.22. This is the software that runs Right To Know, and so many other Freedom of Information request sites around the world. We wouldn’t normally include something so technical in the newsletter but this is a big release and also includes contributions from our very own Luke Bacon and Henare Degan. Some of these contributions are a direct result of AlaveteliCon 2015, which we’ve mentioned in previous editions.

Flagpost

This site was generating a bit of chatter on Twitter recently. We think it’s a really nice example of what can happen when governments engage in an open process and civic hackers see a way to dive in and make it even better. Flagpost collects and overlays additional useful data about submissions to New Zealand’s national flag redesign process. And of course the data to make this happen comes from a scraper, which itself is an extension of another scraper. That’s some fine civic hacking collaboration.

Posted in Civic Tech Monthly | Tagged , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , | Leave a comment

Another 2 million people can get PlanningAlerts

Volunteers at the GoogleServe event

Volunteers at the GoogleServe event

Another 2 million people in Australia now have a simple way to impact development in their local area. Last month the OpenAustralia Foundation teamed up with a group of volunteer engineers from Google and have expanded the availability of our project PlanningAlerts to 21 more local council areas.

The event was part of the GoogleServe program, a week in June where Google staff work on projects with local not-for-profit organisations around the world. It was also the 7th birthday of OAF’s first project OpenAustralia.org.au.

Volunteers writing scrapers together

The crew writing scrapers together

We spent the 16th and 17th of June at Google’s Sydney office. We mostly focused on expanding the coverage of PlanningAlerts but also collecting useful data and introducing our other projects. We looked at summarising divisions in They Vote For You, collected basic information on local councilors throughout Australia and added new capabilities to morph.io. Matthew Landauer, OAF founder and co-director, also kicked things off with a presentation on our projects and mission titled ‘Citizens vs Customers’.

Matthew Landauer talking with attendees after his presentation

Matthew Landauer talking with attendees after his presentation

We had a great time working with everyone and were really impressed by the volunteers practical approach. Here are some of the outcomes from the event:

  • A huge amount of useful data was made public through morph.io;
  • that data is immediately being used by citizens all over Australia through PlanningAlerts;
  • a bunch of open source developers got to know each other and are continuing to collaborate; and
  • we all got to learn and practice scraping, which increases our capacity to collect data.

As a small charity, these practical outcomes, for citizens, attendees and our organisation, are crucial.

2015-06 OAF Google - 24 of 25

Our scoreboard showing some of the councils we worked on and the population count.

A huge thanks to Tim Ansell from Google for organising the event and to everyone who came along. We were extremely impressed by the volunteers. They dived straight in, asked questions without fear, learned what they needed, and started making practical contributions right away. It was a great example of how these kind of hacking events can give existing open source projects a solid boost. What a birthday present to the OpenAustralia Foundation and what a win for local democracy!

Posted in Event, PlanningAlerts.org.au | Tagged , , , , , , | 3 Responses

Civic Tech Monthly, June 2015

Welcome to the fifth edition of Civic Tech Monthly. Below you’ll find news and notes about civic tech from Australia and around the world.

It was our 7th birthday this month (yay!) so we hope you’ll indulge us a little while we share a little more news about ourselves than usual. That also gives us a great reason to remind you that the entire production of Civic Tech Monthly is done in the open on GitHub. We very much welcome your suggestions throughout the month for items to add to the newsletter.

If you’d like to share the sign up link it’s http://eepurl.com/bcE0DX. Sydneysiders, we hope to see you at the Pub Meet on Tuesday.

News and Notes

Scraping get-togethers galore

For the last couple of months Luke Bacon and Matthew Landauer have been working hard on polishing our scraping platform, morph.io. This has handily coincided with a number of Sydney events to do with scraping, organised by us and others.

If you’re in Sydney tonight, Thursday the 25th of June, head along to Australia Open Data’s event organised by Rosie Williams of InfoAus fame. About a fortnight ago Luke organised a low key event dedicated to scraping that was a great success. We also had a great time scraping with Googlers last week and we’ll share more exciting news about that with you very soon – keep your eyes peeled on the blog.

Helping people open governments around the world – AlaveteliCon 2015

Last month #Alaveteli15 had top billing in this newsletter. This month you can read all about Henare Degan’s experience at the conference in his blog post summing up the two huge days he had at the conference. From sharing war stories of peoples’ experiences using Freedom of Information with Right To Know in Australia, through to hatching international plans to develop a guide to what makes a good FOI law in today’s online world.

s115a.com

Australia’s Parliament recently voted to enact legislation which allows copyright holders (like movie, TV and music producers) to apply to the Federal Court to have piracy-related websites blocked. Will Ockenden created s115a.com (named after the section of the law that was added) in an attempt to introduce some transparency to those site-blocking laws by letting you know when an application is made, and (hopefully) the final result.

18F launches new US openFOIA site

18F is the US Government’s digital services delivery team, similar to the well known and regarded GDS in the UK (that the Australian Government is hoping to repeat with the DTO). They have just launched openFOIA, which guides citizens through the process of making a Freedom of Information request. It’s interesting to see a government-delivered take on this (and heartening that they at least acknowledge this has been done before). It’s being developed in the open on GitHub, like everything else 18F do.

A step forward for open government in NSW and the NT

Anyone can now easily make requests for information from New South Wales and the Northern Territory governments and local councils using Right To Know. We’ve added hundreds of new authorities to the site and made a host of changes to facilitate some of the quirks in the different access to information laws. We’re really pleased to see some authorities already helping citizens access information important to them. You can read more of the nitty-gritty in our blog post announcing the launch.

Civic tech job opportunities

It’s not often these come up so we’re excited to hear about two sets of job opportunities this month. Code for Australia is hiring fellows – they’re looking for developers, designers or community organisers to work at the NSW Department of Family and Community Services and the Victorian Neighbourhood Justice Centre. Applications are also open for the next cohort of Knight-Mozilla OpenNews fellows, they’re after developers, technologists, civic hackers, or data crunchers who want to help change the world of journalism.

The next 3 months are going to be really busy – here is our plan

Since the beginning of this year the core team at the OpenAustralia Foundation has been getting together for a day every quarter to make a plan for the upcoming 3 months. This time Matthew Landauer has written up our plans in a blog post. As you’ll see in the post we’ve got another very busy 3 months ahead!

Posted in Civic Tech Monthly | Tagged , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , | Leave a comment