Though early intervention and treatment is linked to improved long-term health, few tools currently exist to
identify people who are at early risk of developing schizophrenia or to predict who will go on to develop
psychosis. The Accelerating Medicines Partnership (AMP®) SCZ program aims to generate tools that will
fast-track the development of effective, early-stage treatments for people who are at risk for
schizophrenia. For more information about the AMP SCZ Study, including protocols, please visit the AMP SCZ Study website.
The AMP-SCZ Release 3.0 includes quality-controlled data from screening through month 2 for 2192
unique subjects. This includes:
-
Clinical and behavioral measures, with baseline data from 1417 to 2101 subjects, month 1
from 1032 to 1234 subjects, and month 2 from 1120 to 1419 subjects, depending on the
measure.
-
Unprocessed electroencephalography (EEG) data include 1454 subjects at baseline and 782
at month 2, as well as EEG task-related and resting-state-derived measures for 1352
participants at baseline and 744 participants at month 2.
-
Unprocessed magnetic resonance imaging (MRI, dMRI, and rs-fMRI) data cover 1159
participants at baseline and 493 participants at month 2.
-
Redacted open interview transcripts include language samples collected at baseline for
766 participants and at month 2 for 423 participants using the Open-ended Language
Sample protocol.
-
Additionally, language samples collected using the PSYCHS clinical interview include
those from 659 participants at screening, 273 at baseline, 159 at month 1, and 86 at
month 2.
-
Raw actigraphy watch data are available for 461 participants, with derived measures of
sleep time, sleep quality, and daily activity available for 450 participants.
Additionally, this release includes smartphone surveys for 926 participants and
smartphone data (including screen use, accelerometry, and geolocation) for 539
participants.
-
Fluid biomarkers are available at baseline for 1746 to 1903 participants and month 2 for
1122 to 1240 participants.
Please refer to the spreadsheet "ampscz-release-3.0-record.csv"
under files for data availability for each measure per subject. For detailed information on NDA variable short names, please refer
to the "nda_vars_20250411.csv" codebook under files.
The behavioral data package includes demographic, physical health, clinical assessments, and
cognitive and behavioral data for participants at screening, baseline, month 1, and month 2. The
AMP SCZ Data Reference Manual for Release 3.0
describes the overall contents of various instruments used in the AMP SCZ project. We recommend downloading
the behavioral package to understand what is available before downloading other packages.
The unprocessed 64 channels EEG data for baseline and month 2, from 5 different paradigm types
(Combined Visual Oddball (VOD) with Mismatch Negativity (MMN), Auditory Oddball (AOD), Auditory
Steady-State Response (ASSR), Resting-State w/ Eyes Open, and Resting-State w/ Eyes Closed are
included in the EEG data package. Sessions that are compliant with standard operating procedures
(SOPs) and have high-quality control ratings are included in this release.
EEG task-related and resting state-derived measures were included. For task-related measures, we
reported average amplitudes of ERP difference waves at each of 64 channels for the MMN and oddball
tasks, as well as average changes in total and evoked power relative to a baseline interval for the
ASSR task. For resting state-derived measures, the mean absolute power spectral density in 5
frequency bands of 64 channels during 180-second resting EEG recordings is provided, in both
eyes-open and eyes-closed conditions.
The unprocessed MRI data from the following modalities are included in this release: structural T1- and
T2-weighted (T1w and T2w), diffusion MRI (dMRI), and resting state functional scans (rs-fMRI).
MRI derived measures included the average diffusion measures from an in-house Tract-Based Spatial
Statistics (TBSS) pipeline and FreeSurfer reconstructions. FreeSurfer reconstructions are available only
for sessions with both T1w and T2w scans. Average diffusion measures are of Johns Hopkins
University white matter regions of interest using TBSS and were not harmonized for site or scanner
differences, though diffusion MRI harmonization is planned for future releases.
Language samples were collected from screening through month 2 using open-ended interviews and
monthly using the PSYCHS clinical interview. The fully redacted transcripts are available in the
Transcripts data package.
Derived language measures include 8 speech fluency and verbosity features, along with 102 linguistic
features, categorized into morpho-syntactic properties, parts of speech, and grammatical
dependencies. Derived measures now track the total number of sentences in a transcript and average
word frequencies separately for interviewers and interviewees.
The raw accelerometer data obtained from the actigraphy watch and smartphone data acquired from
short daily self-report surveys and passive data (accelerometer and screen state) are available in the
Actigraphy & Phone Data package.
Actigraphy derived measures for minute-based activity scores and daily sleep and activity measures
were included. These measures were processed using DPSleep (Deep Phenotyping of Sleep Open
Source pipeline), which analyzes raw accelerometer data to extract sleep and activity parameters
based on minute-level activity levels.
The raw smartphone geolocation data is available in the Phone Geolocation Data package.
Note that accessing the geolocation data requires additional IRB approval.
Downloading Data
Select one or more available datasets in the nested dropdowns below to add subsets of the data to your
Workspace and Filter Cart. Click on the nested dropdowns to explore all the datasets and on the “i” icons
for descriptions. When Creating a Data Package, select "include associated files" to include imaging files
in download packages.
Note: Some datasets are larger than the monthly NDA download limit of 20 Terabytes. To
download datasets over 20TB without delays, end users can request a temporary Download Threshold extension
from the NDA Help Desk at NDAHelp@mail.nih.gov. In your request,
name the data you need to download e.g., AMP SCZ 3.0 resting state functional scans (rs-fMRI) data) and the
data package size. See NDA's User Download
Threshold page for more information.
The "Tips" box on the right provides further instructions and details the remaining steps to select and
download data.