Skip to content

ehrQL backend tables

OpenSAFELY provides secure access to real data via different OpenSAFELY backends.

For example, there is a TPP backend, for querying patient data held by TPP.

The tables that are available for use in ehrQL depend on which OpenSAFELY backend that you access.

We can consider two kinds of table:

  • core tables
  • backend-specific tables

Core tables🔗

If you have read these documentation pages in the suggested order, then all the examples you will have seen access the core tables.

The core tables are intended to be compatible with any OpenSAFELY backend providing primary care data. If you only use the core tables, then your ehrQL dataset definition will be compatible with different OpenSAFELY backends, without requiring any changes.

Importing core tables🔗

Make the core tables available for use in a dataset definition with import statements like:

from ehrql.tables.core import medications, patients

where the ehrql.tables.core specifies that we are using the core tables.

Backend-specific tables🔗

Different OpenSAFELY backends may opt to provide additional data tables other than the core tables. You may wish to access these tables if the core tables are not sufficient to answer the research question of interest.

⚠ When using backend-specific tables in a dataset definition, your dataset definition will be only compatible with that backend.

🗒 Implementation of other backends is still in development. But all data in the EMIS OpenSAFELY backend is available via cohort-extractor.

Importing backend-specific tables🔗

Instead of referring to core in the import statement, we use the name of the backend.

For example, for TPP-specific tables, we use tpp in the import statement:

from ehrql.tables.tpp import addresses, patients

🗒 In this example, the addresses table is specific to the TPP backend. The patients table is a core ehrQL table. We import both core and backend-specific tables within a single import statement.

Using the table schema reference🔗

In the examples given so far, the names of the table schemas, tables and columns have been provided for you.

For example, in the initial dataset definition of the tutorial, we had the following line:

from ehrql.tables.core import patients, practice_registrations, clinical_events, medications
  • core is the table schema
  • patients, practice_registrations, clinical_events and medications are the table names

We also used the show() command to display the patients table and saw that the table columns available were date_of_birth, sex and date_of_death.

Use the table schema reference to look up which schemas and columns are available.

The table schema reference explains:

  • which backends support the schema
  • the table column names
  • the table column data types
  • any additional constraints on table column values
  • additional contextual information about table columns
  • whether table columns contain at most one row per patient, or may contain multiple rows per patient

❔ Consult the core schema. Choose any of the tables there and understand its structure from the schema.