For a fact based worldview

Preconceived ideas

Preconceived ideas

Outdated worldviews from earlier education

Preconceived ideas

Brains are biased towards survival

Preconceived ideas

Media amplifies our biases

Factfulness

The lovely, calm mental state reached when
you know your opinions are based on facts

Factfulness

Gapminder creates educational material

Factfulness

Facts

Factfulness

Fight biases in the media and your brain

Statistics are great facts

Data Description Format

Multi-dimensional collaborative statistics

Datapoints

country year population life_expectancy gdp_per_capita
swe 2015 9.78M 82.1 44.9k
usa 2015 322M 79.1 53.4k
ger 2015 80.7M 80.8 44.1k
swe 2014 9.7M 82.1 44k
... ... ... ... ...


country year co2_pcap
swe 2013 4.61
usa 2013 16.4
ger 2013 9.4
swe 2012 4.93
... ... ...

Entities

country name world_region latitude longitude
swe Sweden europe 62 15
usa United States americas 39.76 -98.5
ger Germany europe 51.5 10.5
... ... ... ... ...


world_region name shape
americas The Americas { ... }
europe Europe { ... }
asia Asia { ... }
africa Africa { ... }

Concepts

concept scales name unit description
country Country
year ["time"] Year Calendar year
life_expectancy ["linear"] Life expectancy years The average number of years a newborn child would live...
gdp_per_capita ["log","linear"] Income per person GDP/capita in $/year adjusted for inflation & prices Gross Domestic Product per person ...
... ... ... ... ...

Datapoints

country year population life_expectancy gdp_per_capita
swe 2015 9.78M 82.1 44.9k
usa 2015 322M 79.1 53.4k
ger 2015 80.7M 80.8 44.1k
swe 2014 9.7M 82.1 44k
... ... ... ... ...


Concepts

concept scales name unit description
country Country
year ["time"] Year Calendar year
life_expectancy ["linear"] Life expectancy years The average number of years a newborn child would live...
gdp_per_capita ["log","linear"] Income per person GDP/capita in $/year adjusted for inflation & prices Gross Domestic Product per person ...
... ... ... ... ...

Entities

country name world_region latitude longitude
swe Sweden europe 62 15
usa United States americas 39.76 -98.5
ger Germany europe 51.5 10.5
... ... ... ... ...


world_region name shape
americas The Americas { ... }
europe Europe { ... }
asia Asia { ... }
africa Africa { ... }

Basic DDF

Entity sets

Entity sets

geo name is--country is--un_state is--city
swe Sweden true true false
twn Taiwan true false false
sgp Singapore true true true
sth Stockholm false false true
pacific Pacific Ocean false false false
... ... ... ... ...


Concepts

concept concept_type domain
geo entity_domain
country entity_set geo
un_state entity_set geo
city entity_set geo

Entity sets: one entity, multiple sets

Entities

country name world_region latitude longitude
swe Sweden europe 385 85
sgp Singapore asia 583 75
sth Stockholm europe 484 84
... ... ... ... ...


Datapoints

country year population gdp_per_capita
swe 2015 385 3858575
usa 2015 583 385839
ger 2015 484 5894
swe 2014 200 4839595
... ... ... ...

Entity sets: one entity, multiple sets

UN states Countries

Entity sets: one entity, multiple sets

Multi-dimensionality

Datapoints

country year population
swe 2015 385
usa 2015 583
swe 2014 200
... ... ...


country year age population
swe 2015 0 200
swe 2015 1 185
... ... ... ...


country year age gender population


country year gender age educ population

Multi-dimensionality

Translations

Concepts

/ddf--concepts.csv
concept scales name description
country Country
year ["time"] Year Calendar year
life_expectancy ["linear"] Life expectancy Expected length of life ...
gdp_per_capita ["log","linear"] Income per person Gross Domestic Product per ...


Entities

/ddf--entities--geo.csv
geo name long_name
swe Sweden The Kingdom of Sweden
ger Germany Federal Republic of Germany

Concepts

/lang/ar-SA/ddf--concepts.csv
concept name description
country

الدولة

year

الوقت

تقويم سنوي

life_expectancy

متوسط العمر المتوقع

متوسط عدد السنوات التي قد يعيشها طفل حديث الولادة في حال بقيت معدلات وفيات الأطفال الحالية كما هي.

gdp_per_capita

نصيب الفرد من الدخل

إجمالي الناتج المحلي للفرد معدل حسب اختلاف القوة الشرائية (تعادل القوة الشرائية استناداً إلى برنامج المقارنات الدولية لعام 2011).



Entities

/lang/ar-SA/ddf--entities--geo.csv
geo name long_name
swe

السويد

مملكة السويد

ger

ألمانيا

جمهورية ألمانيا الاتحادية

Translations

Each string concept can be translated

DDF and other data formats

Data Pipeline

Data Pipeline

Data Pipeline

Data kitchen example


    ingredients:
        - id: cdiac-entities
          dataset: ddf--cdiac--co2
          key: nation
          value: "*"
        - id: gapminder-entities
          dataset: ddf--gapminder--geo_entity_domain
          key: country
          value: "*"

    cooking:
        datapoints:
            - procedure: translate_column
              ingredients:
                  - cdiac-entities
              options:
                  column: name
                  target_column: geo
                  dictionary:
                      key: ['name', 'gapminder_list','alternative_1', 'alternative_2', 'alternative_3',
                            'alternative_4_cdiac', 'pandg', 'god_id', 'alt_5', 'upper_case_name', 'arb1', 
                            'arb2', 'arb3', 'arb4', 'arb5', 'arb6'] 
                      value: country
                      base: gapminder-entities
                  not_found: drop
                  ignore_case: true
                  debug: true
              result: geo-entities-translated
        

Data Pipeline

Recipe code example

Data Pipeline

Recipe diagram

Open Numbers

Crowd-sourced harmonization of global & local statistics

Let's build a fact base together!