KEMBAR78

Getting Tweet Content Examples

Getting Tweet Content Examples

Oxford Spring School, 2022

Christopher Barrie

Introduction

Why get tweets?
- Testing API call (again…)
- We can investigate text beyond keywords
- We can investigate additional tweet characteristics (metrics, annotations)
- We can investigate user characteristics…

Getting tweets

As barometer of public opinion

Getting tweets

As barometer of public opinion
- Coverage?

Getting tweets

As barometer of public opinion
- Bias?

Getting tweets

As barometer of public opinion
- Advantages versus surveys?
  - Real-time (“always on”)
  - More accurately/realistically measured (?)

Sampling tweets

By keyword

Geolocating tweets

By user-reported location

Getting tweets

As measure of individual attitudes

)

Getting tweets

As measure of individual attitudes

Getting tweets

As measure of individual attitudes

Sampling tweets

(Here) reliance on hacked lists
Continued streaming

Sampling tweets

Design considerations:

Attrition via takedown
Tweet honesty?
Alternative sampling strategy?
Geolocation…

Geolocating tweets

V small pct. of tweets are geolocated (.1–1%)
Have to rely on coarse-grained location
Or infer from “friends” (whom an individual follows)
- Blunt force: frequency of locations of friends network
- More precise: spatial label propagation
  - Get location of friends
    - Get locations of friends of friends…
  - Predict user location from distribution

Coding tweets

Coding criteria
MTurk
1. Use labels as outcome
2. Use to train supervised ML algorithm

Getting tweets

As a measure of popularity/engagement

Getting tweets

As a measure of popularity/engagement

Getting tweets

As a measure of popularity/engagement
- Accurate measure of human attention?
- Accurate measure of endorsement?

Group work

Replicate Flores (2017) using get_all_tweets() and another event of interest
Formulate code needed to request that data
(If access granted) make the API call to get data
Consider limitations + design considerations