Online appendix for

“Extracting Development Tasks to Navigate Software Documentation”

1. Regular expression for the detection of code elements

[A-Z][a-zA-Z]+ ?<[A-Z][a-zA-Z]*>

[a-zA-Z0-9\.]+[(][a-zA-Z_,\.]*[)]

(https?://)?[a-zA-Z_\\-/]{2,}(\.[a-zA-Z_0-9\\-]{2,})+[^\s\<\>{\(\),'\"”’}:]*

([\.]?[/]?\w+\.\w+\.?\w+(?:\.\w+)*)

[A-Za-z]+\.[A-Z]+

[@][a-zA-Z]+

(?:\s|^)([a-zA-z]{3,}\.[A-Za-z]+_[a-zA-Z_]+)

\b([A-Z]{2,})\b

(?:\s|^)([A-Z]+_[A-Z0-9_]+)

(?:\s|^)([a-z]+_[a-z0-9_]+)

\w{3,}:\w+[a-zA-Z0-9:]*

(?:\s|^)([A-Z]+[a-z0-9]+[A-Z][a-z0-9]+\w*)(\s|\.\s|\.$|$|,\s)

(?:\s|^)([A-Z]{3,}[a-z0-9]{2,}\w*)(\s|\.\s|\.$|$|,\s)

(?:\s|^)([a-z0-9]+[A-Z]+\w*)(\s|\.\s|\.$|$|,\s)

(?:\s|^)(\w+\([^)]*\))(\s|\.\s|\.$|$|,\s)

([A-Z][a-z]+[A-Z][a-zA-Z]+)(\s|,|\.|\))

(?:\s|^)([a-z]+[A-Z][a-zA-Z]+)(\s|,|\.|\))

(?:\s|^)([A-Z]+[a-z0-9]+[A-Z][a-z0-9]+\w*)(\s|\.\s|\.$|$|,\s)

(?:\s|^)([A-Z]{3,}[a-z0-9]{2,}\w*)(\s|\.\s|\.$|$|,\s)

(?:\s|^)([a-z0-9]+[A-Z]+\w*)(\s|\.\s|\.$|$|,\s)

(?:\s|^)(\w+\([^)]*\))(\s|\.\s|\.$|$|,\s)

([A-Z][a-z]+[A-Z][a-zA-Z]+)(\s|,|\.|\))

([a-z]+[A-Z][a-zA-Z]+)(\s|,|\.|\))

([a-z] )([A-Z][a-z]{3,11})( )

</?[a-zA-Z0-9 ]+>

\{\{[^\}]*\}\}

\{\%[^\%]*\%\}

/[^/]*/

‘[^’]*’

__[^_]*__

\$[A-Za-z\_]+

2. Verbs considered as programming actions

access

acquire

activate

add

adjust

align

append

apply

archive

ask

assign

attach

bind

book

boost

bootstrapp

break

cache

calculate

call

catch

center

change

check

checkout

choose

clean

clone

combine

compare

compile

complete

compose

compress

compute

configure

convert

count

create

crop

customize

deactivate

defer

define

delete

deploy

describe

design

determine

develop

differentiate

disable

discontinue

display

divide

document

download

duplicate

edit

embed

enable

encapsulate

encode

enter

exclude

execute

expand

expose

extend

fetch

fill

find

fit

fix

flush

force

format

generate

get

group

handle

hardcode

hide

host

identify

ignore

implement

import

include

indent

inherit

initialize

insert

install

instantiate

integrate

introduce

isolate

learn

limit

link

list

load

locate

log

log in

login

manage

manipulate

mark

match

mock

modify

move

multiply

obtain

offer

omit

open

order

output

override

overwrite

pass

paste

patch

perform

place

play

prefix

prepare

present

prevent

price

print

process

produce

provide

purchase

push

raise

reach

read

rebuild

receive

recompile

recreate

redefine

redirect

refactor

reference

regenerate

release

remove

rename

render

repeat

replace

replicate

request

resolve

restart

retrieve

return

reuse

run

save

scale

search

select

sell

send

separate

seperate

set

set up

share

shorten

show

sign

simulate

skip

sort

specify

split

store

submit

subscribe

summarize

support

surround

switch

test

throw

track

translate

trigger

trim

update

use

wrap

write

3. Objects considered to be too generic

a

addition

all

any

anything

each

everything

example

functionality

it

manner

more

much

one

ones

order

other

own

problem

some

that

them

these

they

thing

this

us

using

what

which

you

your

yourself

4. Domain terms to be tagged as nouns

backend

css

decorator

file

font color

html

ie

internet explorer 8

javascript

meta data

meta info

nose

parameter

parameters

po file

python

slug

template

templates

[product name]

[product name]

[product name]

[product specific concept]

[product specific concept]

[product specific concept]