The Crowd and the Cosmos: Adventures in the Zooniverse

Page 29

by Lintott, Chris

tried in these pages to convey not only my excitement about the

results of our projects, but that of the people who get caught up

in them. Participation in Zooniverse projects gets science done,

but it’s more than that. A study led by Karen Masters, Galaxy

Zoo’s project scientist (and now the elected spokesperson of the

entire Sloan Digital Sky Survey), which asked questions of volun-

teers while they were classifying, showed that people who par-

ticipate in such projects learn things.

They learn things, in fact, that they couldn’t have learned from

the projects themselves. In other words, taking part in, say,

Galaxy Zoo, inspires people to go out and seek out more infor-

mation about the Universe. The projects act as an engine of

motivation, creating a cohort of people who are actively seeking

information they never knew they wanted. Think of Planet

Hunters finding the details of transiting planets, or Old Weather

participants digging into naval history, or any of the other byways

and distractions we’ve inspired. Gamifying the experience—hid-

ing the science behind a thin veneer of play, making it feel less

like real science and more like any other game on your phone—

might make projects more efficient, but it would kill this most

important side effect of participation stone dead.

Three PaThs 241

You can see this in the studies researchers have done of the

effect of even the simple gamification—the ranks on board the

ship—that we added to the Old Weather project. Volunteers

they interviewed said that the game worked as designed, in that

it made them more likely to do more work, but there was

another, more disturbing result. Instead of describing partici-

pating as fun and interesting, they suddenly used language that

made it seem like work. One volunteer, anonymous in the

study, said that though they made it to captain they found it

stressful trying to stay ahead. So our options in reality two,

where we need to resort to either hiding the task within a game

or to using the kind of manipulation of reward that makes people

feel like they’re burdened with a task, are between projects

which don’t change people’s thoughts about whether they can

participate in science and those that feel like you’ve taken on a

second job. In this reality, science is still reserved for the clever few who capture the attention and time of others through

design, using the resulting effort for their own purposes.

Participants are motivated not by curiosity, but by competi-

tion. It may be effective, but it seems a long way from the best

that we could manage.

I want us to live in a third reality. This one is going to take some

work, I think. It’s a universe in which we don’t need to rely on

advances in machine learning to get the best out of the wealth of

scientific data that we now have access to. It’s one where human

intuition and pattern recognition are still needed to get the most

from data, even when machines are good at classification them-

selves. I feel pretty confident that this is indeed part of the reality we live in; though the recent advances in machine learning have

been breathtaking, I think that our science is weird enough and

our requirements exact enough that there will be a human

element to it for a long while yet.

242 Three PaThs

However, as I’ve repeatedly stressed, our ability to collect

information about the Universe continues to astound. We

shouldn’t expect the pace at which new data flows in to decrease,

nor should we expect it to become less open.* Because I want to

keep communities of volunteers consciously participating in sci-

ence, with no hiding in games, this means accepting that we

won’t be able to rely on citizen scientists to do all the work.

We’ve already opened the door to a solution. The supernova

project showed that when humans and machines classify in con-

cert, the outcome can be better than either working alone. I

reckon that machines, as they improve while the surveys grow,

will take on more of the burden, leaving the volunteers to review

the unusual, the unexpected, and the interesting. The work

comes in deciding how to divide the effort in such a way that

allows the most interesting objects to be found; this probably

means wading through a lot of confusing images or, for some

projects, a lot of junk with little inherent interest. We need to

understand how participants in Zooniverse projects want to

work alongside the robot colleagues that my clever machine-

learning colleagues are building for them.

This isn’t a problem unique to us. In clicking your way around

the material that Facebook chooses to show when you log in,

you are in some sense collaborating with its algorithms. You are

providing information about what the site should do next, which

it responds to by showing you things it thinks you want to see.

* I’m somewhat dismayed that the LSST project has now taken money from international collaborators, including those of us in the UK, to help fund its operating costs in exchange for privileged access to data. I hope the leadership will see sense and, despite the need for cash to keep the lights on and the servers humming, find a way to go back to what was once imagined as the most open of projects, with data freely shared and available to everyone. The sky the telescope will scan, image, and monitor, after all, belongs to no one, and there is certainly plenty of science to go round.

Three PaThs 243

More precisely, it will show you some combination of what it

thinks you want to see, content that is most likely to expand the

time you choose to spend on Facebook and content such as

adverts that is profitable for the platform.

I hope that makes your skin crawl just a little. I think we’re just

beginning to understand how our attention is being manipulated

on the internet, and to work out how to talk about it. I think that

setting up a project like Galaxy Zoo, but with machine-learning

classifiers actively working alongside human ones, is a fascinat-

ing problem which allows us to think about what we want. Even

simple examples pose dilemmas. One worry is that if we allow a

neural network to take the images it is most certain about away

from classifiers then our poor humans might lose the brightest

and most interesting images faster than the faint blobs, which are

harder to classify. If we assume that people are, for all that they

tell us they’re in it for the science, partly interested in the brightest and most interesting galaxies, even if subconsciously the

dopamine hit of suddenly seeing something spectacular keeps

them classifying, then in allowing the machine to remove pre-

cisely these galaxies from circulation we have built a project

which gets progressively less appealing over time.

Yet we have a problem. I can’t just put the bright galaxies back

in the pot without compromising on the promise, implicit in any

citizen science project and explicit on the Zooniverse, that any

work done by someone will actually be used. I
experienced an

early warning that this was going to be difficult when we shut

down our original supernova project. As I said earlier, that deci-

sion was triggered when the researchers involved switched to an

automatic classifier, and told me that they would no longer use

the results of of citizen scientists’ efforts.

On the face of it, an easy decision. The classifications were not

being used, so we no longer had a project worth participating in.

244 Three PaThs

Yet the participants, many of them dedicated people who would

come back whenever new data was released, searching for super-

novae time and time again, were not happy. I ended up calling a

few of them to understand their views, and they were pretty

unanimous in the fact they wanted their work to be used, for sure.

No one I spoke to would participate in the project if we told them

that we would just throw the results away. But they didn’t under-

stand why the researchers would switch to an automated system.

The researchers, I think, saw the automated system as just eas-

ier to understand. Given that a modern convolutional neural net-

work can be essentially a black box, as inscrutable at least as the

average person, I’m not sure that this is justified, but I see why

they might think like that. We didn’t think clearly enough about

what this would feel like to the volunteers. One day they were

contributing classifications that made a real difference scientific-

ally. The next day they weren’t, even though from their perspec-

tive nothing had changed; it’s not as if they suddenly got worse.

What had seemed to be a collaborative project was suddenly

looking rather one-sided.

I think what we got wrong here was the lack of control we gave

the volunteers, suddenly wrenching their project away from

them, and I think that’s the key to how to cope with the com-

plexities of this third reality, when we combine human and

machine classifications. If we give people control over what they

see, they can make their own decisions about how they want

things to run. I really like the idea of a project that says, ‘We know if we give you more beautiful galaxies (or spectacular penguins,

or interesting texts) you’re likely to stay around longer. These

classifications won’t count, but do you want to see these images

anyway? If so, how often?’

That seems honest and interesting, and I hope will lead to a sys-

tem that can cope with the majority of the data heading our way.

Three PaThs 245

If we don’t do something like this, I worry that we’ll miss out on the most unexpected of finds. On the contrary, I think we’re likely—if

we get it right—to be overwhelmed with interesting things.

Imagine a typical night, a few years from now when LSST is

operating. As the Sun sets over the Chilean Andes, the dome

containing the telescope opens up to allow it to cool in the cold

night air. As the sky darkens, the enormous beast of a machine

inside starts to methodically work its way across the sky, never

pausing in one place for very long but often flicking back to

where it has already been, to keep an eye out for asteroids and

other rapidly moving or changing phenomena.

As the telescope and its camera work away, the images it takes

are flowing digitally away from the mountaintop observatory

and out into the world. They will soon end up at the US National

Supercomputing Center in Illinois, where code will compare

each one to previous images of the same field, checking the

brightness of millions of objects. In any given image, millions of

times a night, some object or other will be found to have changed

in brightness, or to have apparently appeared from nowhere—or

vanished completely.

LSST deals with this by issuing an alert, a public declaration of

something happening in the sky. Its massive database, eventually

laden with the fruits of ten years of surveying, will provide details of the history of each source. And then it’s up to the rest of us.

Software ‘brokers’ will try to filter this unprecedented torrent of

data, sending the cosmologists pristine type 1a supernovae and

planetary scientists a steadily growing list of candidate asteroids

and Kuiper Belt Objects. One of these brokers will be listening

too, but it will have a different job, directing objects to the screens of volunteers around the world.

Alerts will ping on a thousand mobile phones; something has

happened in the sky, and we need your help. By the time the Sun

246 Three PaThs

rises in Chile, tens of thousands or maybe hundreds of thousands

of images will have been inspected by a crowd consisting of both

the astronomically passionate and the mildly curious. Sunrise in

Chile means that it’s nearly night in Australia, and we’ll need to

have identified the most interesting things by the time telescopes

there are opening for the evening.

Maybe the centre of a nearby galaxy has brightened because

something is falling into its black hole. Maybe a slow-moving

object looks like a promising contender to be the latest member

of the swarm of bodies out around Pluto. Maybe we caught a

planet in transit in front of a star, or just the star itself behaving badly. Whatever the case, contributions from people like you

will help determine what happens next. As the Earth spins, tele-

scopes in Hawai’i and the Canary Islands and in South Africa join

in; for the most energetic events, information from space-based

satellites will be added to the mix.

For each of these events, triggering the worldwide network of

observatories to stare at the right place is merely the start.

Understanding what they are telling us will take a lot of time, and

will overwhelm professional astronomers like myself. As data

becomes more open, we’ll see networks of citizen astronomers

spring up to discuss and debate their favourite objects. Some of

the participants are undoubtedly already experts in the field;

some will bring skills that are of great use, and others just a will-

ingness to learn. They will talk to and collaborate with the

increasing number of scientists who have discovered just how

powerful working in this way really is.

Between us, in this best of all possible worlds, we will have

built a new way of exploring the Universe: something that takes

the best features of Galaxy Zoo and Planet Hunters and all the

other projects from the last decade and turns them into some-

thing even more inclusive, more powerful, and above all else

Three PaThs 247

much more fun, as volunteers control not only the discovery

but the investigation of the things that are uncovered. I hope

that if we look back in a decade’s time at twenty years of citizen

science through these projects, there will be a completely new

crop of strange anomalies and curious objects to talk about.

I would be very, very surprised to find myself in any reality other

than this one.

It does, however, need you. You and your ver
y human talent

for pattern recognition and for being distracted from a task. You,

with your curiosity and interest and willingness to spend just a

few moments in your day doing something in contemplation of

the Universe. You, with just a little time to join with millions of

others so that collectively we can all achieve amazing things.

Making the best of our capacity as a species to explore the

Universe, and to understand the world around us, I believe,

depends on finding a way that everyone on the planet can partici-

pate as an active observer and interpreter of the data that’s now

available. If we really can get everyone to join in—even if only for

a few minutes—with this great endeavour, who knows what we

might find, sitting out there and just waiting to be discovered.

REFERENCES

I haven’t tried to give a complete bibliography of works about citizen science, or even about the Zooniverse. An updated and nearly complete list of publica-tions produced by Zooniverse projects is maintained at Zooniverse.org/publications, and the ‘Meta’ category there is a good starting point for those looking for academic studies of what we’ve been up to.

Preface

Christiansen, Jessie L. et al., 2018, The K2–138 System: A Near-Resonant Chain of Five Sub-Neptune Planets Discovered by Citizen Scientists, Astronomical Journal, 155, 2, https://arxiv.org/abs/1801.03874.

Lakdawalla, Emily, 2009, An ‘Amateur’ Discovers Moons in Old Voyager

Images, planetary.org, 5 August, http://www.planetary.org/blogs/emily-

lakdawalla/2009/2035.html.

chaPter 2

Bailey, Jeremy et al., 1998, Circular Polarization in Star-Formation Regions: Implications for Biomolecular Homochirality, Science, 281, 5377, 672.

Finkbeiner, Anne, 2010, A Grand and Bold Thing, Free Press. Provides a history of the Sloan Digital Sky Survey project.

Ivezić, Željko et al., 2018, LSST: From Science Drivers to Reference Design and Anticipated Data Products, https://arxiv.org/abs/0805.2366.

Reid, David and Chris Lintott, 1996, Astronomy at Torquay Boys’ Grammar School, Journal of the British Astronomical Association, 5, 265, http://articles.

adsabs.harvard.edu/full/1996JBAA..106..265R.

chaPter 3

Land, Kate et al., 2008, Galaxy Zoo: The Large-Scale Spin Statistics of Spiral Galaxies in the Sloan Digital Sky Survey—Clockwise and Anticlockwise

Galaxies, Monthly Notices of the Royal Astronomical Society, 388, 1686.

‹ Prev Next ›