NIMH Data Archive - Data

Genomics

Neuroimaging

Phenotype¹

New Trial
Clinical Trial

¹ Numbers reported are subjects by age

New Project
Grant/Project Number

Format should be in the following format: Activity Code, Institute Abbreviation, and Serial Number. Grant Type, Support Year, and Suffix should be excluded. For example, grant 1R01MH123456-01A1 should be entered R01MH123456

Collection - Use Existing Experiment

To associate an experiment to the current collection, just select an axperiment from the table below then click the associate experiment button to persist your changes (saving the collection is not required). Note that once an experiment has been associated to two or more collections, the experiment will not longer be editable.

The table search feature is case insensitive and targets the experiment id, experiment name and experiment type columns. The experiment id is searched only when the search term entered is a number, and filtered using a startsWith comparison. When the search term is not numeric the experiment name is used to filter the results.

Select	Experiment Id	Experiment Name	Experiment Type	Created On

24	HI-NGS_R1	Omics	02/16/2011
475	MB1-10 (CHOP)	Omics	06/07/2016
490	Discovery and CRISPR validation of genetic factors associated with antipsychotic-induced weight gain and cardiometabolic risk	Omics	07/07/2016
501	PharmacoBOLD Resting State	fMRI	07/27/2016
506	PVPREF	Omics	08/05/2016
509	ABC-CT Resting v2	EEG	08/18/2016
13	Comparison of FI expression in Autistic and Neurotypical Homo Sapiens	Omics	12/28/2010
18	AGRE/Broad Affymetrix 5.0 Genotype Experiment	Omics	01/06/2011
22	Stitching PCR Sequencing	Omics	02/14/2011
26	ASD_Methylation	Omics	03/01/2011
29	Microarray family 03 (father, mother, sibling)	Omics	03/24/2011
37	Standard paired-end sequencing of BCRs	Omics	04/19/2011
38	Illumina Mate-Pair BCR sequencing	Omics	04/19/2011
39	Custom Jumping Libraries	Omics	04/19/2011
40	Custom CapBP	Omics	04/19/2011
41	Immunofluorescence	Omics	05/11/2011
43	Autism brain sample genotyping, Illumina	Omics	05/16/2011
47	ARRA Autism Sequencing Collaboration at Baylor. SOLiD 4 System	Omics	08/01/2011
53	AGRE Omni1-quad	Omics	10/11/2011
59	AGP genotyping	Omics	04/03/2012
60	Ultradeep 454 sequencing of synaptic genes from postmortem cerebella of individuals with ASD and neurotypical controls	Omics	06/23/2012
63	Microemulsion PCR and Targeted Resequencing for Variant Detection in ASD	Omics	07/20/2012
76	Whole Genome Sequencing in Autism Families	Omics	01/03/2013
519	Resting	fMRI	11/08/2016
90	Genotyped IAN Samples	Omics	07/09/2013
91	NJLAGS Axiom Genotyping Array	Omics	07/16/2013
93	AGP genotyping (CNV)	Omics	09/06/2013
106	Longitudinal Sleep Study. H20 200. Channel set 2	EEG	11/07/2013
107	Longitudinal Sleep Study. H20 200. Channel set 3	EEG	11/07/2013
108	Longitudinal Sleep Study. AURA 200	EEG	11/07/2013
105	Longitudinal Sleep Study. H20 200. Channel set 1	EEG	11/07/2013
109	Longitudinal Sleep Study. AURA 400	EEG	11/07/2013
116	Gene Expression Analysis WG-6	Omics	01/07/2014
131	Jeste Lab UCLA ACEii: Charlie Brown and Sesame Street - Project 1	Eye Tracking	02/27/2014
132	Jeste Lab UCLA ACEii: Animacy - Project 1	Eye Tracking	02/27/2014
133	Jeste Lab UCLA ACEii: Mom Stranger - Project 2	Eye Tracking	02/27/2014
134	Jeste Lab UCLA ACEii: Face Emotion - Project 3	Eye Tracking	02/27/2014
145	AGRE/FMR1_Illumina.JHU	Omics	04/14/2014
146	AGRE/MECP2_Sanger.JHU	Omics	04/14/2014
147	AGRE/MECP2_Junior.JHU	Omics	04/14/2014
151	Candidate Gene Identification in familial Autism	Omics	06/09/2014
152	NJLAGS Whole Genome Sequencing	Omics	07/01/2014
154	Math Autism Study - Vinod Menon	fMRI	07/15/2014
155	Resting	fMRI	07/25/2014
156	Speech	fMRI	07/25/2014
159	Emotion	fMRI	07/25/2014
160	syllable contrast	EEG	07/29/2014
167	School-age naturalistic stimuli	Eye Tracking	09/19/2014
44	AGRE/Broad Affymetrix 5.0 Genotype Experiment	Omics	06/27/2011
45	Exome Sequencing of 20 Sporadic Cases of Autism Spectrum Disorder	Omics	07/15/2011

Collection - Add Experiment

Add Supporting Documentation

Funding Source:
URL:

To add an existing Data Structure, enter its title in the search bar. If you need to request changes, select the indicator "No, it requires changes to meet research needs" after selecting the Structure, and upload the file with the request changes specific to the selected Data Structure. Your file should follow the Request Changes Procedure. If the Data Structure does not exist, select "Request New Data Structure" and upload the appropriate zip file.

Use/Modify Existing Data Structure

Request New Data Structure

Targeted Enrollment:

Initial Submission Date:

Initial Share Date:

Data Structure Search:

Data Structures:

Submit

Request Submission Exemption

Not Eligible

The Data Expected list for this Collection shows some raw data as missing. Contact the NDA Help Desk with any questions.

Please confirm that you will not be enrolling any more subjects and that all raw data has been collected and submitted.

Collection Updated

Your Collection is now in Data Analysis phase and exempt from biannual submissions. Analyzed data is still expected prior to publication or no later than the project end date.

[CMS] Error

[CMS]

Unable to change collection phase where targeted enrollment is less than 90%

You have requested to move the sharing dates for the following assessments:

Data Expected Item	Original Sharing Date	New Sharing Date

Please provide a reason for this change, which will be sent to the Program Officers listed within this collection:

Explanation must be between 20 and 200 characters in length.

Please press Save or Cancel

Combining Medications to Enhance Depression Outcomes (CO-MED) #2158

General
Experiments (0)
Shared Data
Publications (0)
Associated Studies (7)

Collection Title	Collection Investigators	Collection Description
Collection Title:	Combining Medications to Enhance Depression Outcomes (CO-MED)
Collection Investigators:	Madhukar H. Trivedi
Collection Description:	The overall aim of Combining Medications to Enhance Depression Outcomes (CO-MED) is to enhance remission rates for outpatients with chronic or recurrent nonpsychotic major depressive disorder (MDD) as defined by DSM-IV TR, treated in primary or psychiatric care settings.Current evidence indicates that remission, the goal of treatment, is found in only about one-third of representative depressed outpatients treated for up to 14 weeks with an initial SSRI. In addition, even for those who do respond or remit, over one-third relapse in the subsequent 12 months. Combinations of antidepressants are used in practice at the second or subsequent steps when relapse occurs in the longer term, or, in some cases, even acutely as a first step when speed of effect is a clinical priority. Whether such combinations could potentially offer higher remission rates, lower attrition, or greater longer-term benefit if used as initial treatments as compared to monotherapy remains to be examined.CO-MED will test whether two different medications when given in combination as the first treatment step, compared to one medication, will enhance remission rates, increase speed of remission, be tolerable, and provide better sustained benefits in the longer term. Results of this study will inform practitioners in managing the treatment of patients with chronic or recurrent MDD.
Data Repository:	NIMH Data Archive
Permission Group:
Collection Creation Date:	03/31/2015
NIH Research Initiative:	NIMH Repository & Genomics Resource (NRGR)
Collection Phase:	Funding Completed
Collection Sub-Phase:	Close Out
Blinded Clinical Trial:	No
Subjects Shared:	816
Collection DOI:	10.15154/5nth-4y60

{"values":[]}

Loading Chart...

Funding Sources:

Funding Source Name	Funding Source URL
NIH - Contract	None

Supporting Documentation:

File Name	File Type	Description	Audience
CO-MED_Protocol.pdf	Methods	Protocol	Qualified Researchers
CO-MED_Published_Reports.pdf	Publication	Published Reports	Qualified Researchers
CO-MED_Table_of_Forms.pdf	Methods	Schedule of Assessments	Qualified Researchers
CO-MED_data_oddities.pdf	Other	Data Oddities	Qualified Researchers
CO-MED_Overview.pdf	Background	Overview	Qualified Researchers
Document_Tracking_Form.docx	Other	Document Tracking Form	Qualified Researchers

Grant Information:

Clinical Trials:

Brief Summary	Status	Clinical Trial ID	Study ID	Principal Investigator	Start Date	End Date
This study will compare whether a combination of antidepressant medications is better than one antidepressant medication alone when given as initial treatment for people with chronic or recurrent major depressive disorder.	Completed	NCT00590863	N01 MH090003-02	Madhukar H. Trivedi, MD	March 2008	September 2009

helpcenter.collection.general-tab

Collection - General Tab

Fields available for edit on the top portion of the page include:

Collection Title
Investigators
Collection Description
Collection Phase
Funding Source
Clinical Trials

Collection Phase: The current status of a research project submitting data to an NDA Collection, based on the timing of the award and/or the data that have been submitted.

Pre-Enrollment: The default entry made when the NDA Collection is created.
Enrolling: Data have been submitted to the NDA Collection or the NDA Data Expected initial submission date has been reached for at least one data structure category in the NDA Collection.
Data Analysis: Subject level data collection for the research project is completed and has been submitted to the NDA Collection. The NDA Collection owner or the NDA Help Desk may set this phase when they’ve confirmed data submission is complete and submitted subject counts match at least 90% of the target enrollment numbers in the NDA Data Expected. Data submission reminders will be turned off for the NDA Collection.
Funding Completed: The NIH grant award (or awards) associated with the NDA Collection has reached its end date. NDA Collections in Funding Completed phase are assigned a subphase to indicate the status of data submission.
- The Data Expected Subphase indicates that NDA expects more data will be submitted
- The Closeout Subphase indicates the data submission is complete.
- The Sharing Not Met Subphase indicates that data submission was not completed as expected.

Blinded Clinical Trial Status:

This status is set by a Collection Owner and indicates the research project is a double blinded clinical trial. When selected, the public view of Data Expected will show the Data Expected items and the Submission Dates, but the targeted enrollment and subjects submitted counts will not be displayed.
Targeted enrollment and subjects submitted counts are visible only to NDA Administrators and to the NDA Collection or as the NDA Collection Owner.
When an NDA Collection that is flagged Blinded Clinical Trial reaches the maximum data sharing date for that Data Repository (see https://nda.nih.gov/nda/sharing-regimen.html), the embargo on Data Expected information is released.

Funding Source

The organization(s) responsible for providing the funding is listed here.

Supporting Documentation

Users with Submission privileges, as well as Collection Owners, Program Officers, and those with Administrator privileges, may upload and attach supporting documentation. By default, supporting documentation is shared to the general public, however, the option is also available to limit this information to qualified researchers only.

Grant Information

Identifiable details are displayed about the Project of which the Collection was derived from. You may click in the Project Number to view a full report of the Project captured by the NIH.

Clinical Trials

Any data that is collected to support or further the research of clinical studies will be available here. Collection Owners and those with Administrator privileges may add new clinical trials.

Frequently Asked Questions

How does the NIMH Data Archive (NDA) determine which Permission Group data are submitted into?

During Collection creation, NDA staff determine the appropriate Permission Group based on the type of data to be submitted, the type of access that will be available to data access users, and the information provided by the Program Officer during grant award.
How do I know when a NDA Collection has been created?

When a Collection is created by NDA staff, an email notification will automatically be sent to the PI(s) of the grant(s) associated with the Collection to notify them.
Is a single grant number ever associated with more than one Collection?

The NDA system does not allow for a single grant to be associated with more than one Collection; therefore, a single grant will not be listed in the Grant Information section of a Collection for more than one Collection.
Why is there sometimes more than one grant included in a Collection?

In general, each Collection is associated with only one grant; however, multiple grants may be associated if the grant has multiple competing segments for the same grant number or if multiple different grants are all working on the same project and it makes sense to hold the data in one Collection (e.g., Cooperative Agreements).

Glossary

Administrator Privilege

A privilege provided to a user associated with an NDA Collection or NDA Study whereby that user can perform a full range of actions including providing privileges to other users.
Collection Owner

Generally, the Collection Owner is the contact PI listed on a grant. Only one NDA user is listed as the Collection owner. Most automated emails are primarily sent to the Collection Owner.
Collection Phase
The Collection Phase provides information on data submission as opposed to grant/project completion so while the Collection phase and grant/project phase may be closely related they are often different. Collection users with Administrative Privileges are encouraged to edit the Collection Phase. The Program Officer as listed in eRA (for NIH funded grants) may also edit this field. Changes must be saved by clicking the Save button at the bottom of the page. This field is sortable alphabetically in ascending or descending order. Collection Phase options include:
- Pre-Enrollment: A grant/project has started, but has not yet enrolled subjects.
- Enrolling: A grant/project has begun enrolling subjects. Data submission is likely ongoing at this point.
- Data Analysis: A grant/project has completed enrolling subjects and has completed all data submissions.
- Funding Completed: A grant/project has reached the project end date.
Collection Title

An editable field with the title of the Collection, which is often the title of the grant associated with the Collection.
Grant

Provides the grant number(s) for the grant(s) associated with the Collection. The field is a hyperlink so clicking on the Grant number will direct the user to the grant information in the NIH Research Portfolio Online Reporting Tools (RePORT) page.
Supporting Documentation

Various documents and materials to enable efficient use of the data by investigators unfamiliar with the project and may include the research protocol, questionnaires, and study manuals.
NIH Research Initiative

NDA Collections may be organized by scientific similarity into NIH Research Initiatives, to facilitate query tool user experience. NIH Research Initiatives map to one or multiple Funding Opportunity Announcements.
Permission Group

Access to shared record-level data in NDA is provisioned at the level of a Permission Group. NDA Permission Groups consist of one or multiple NDA Collections that contain data with the same subject consents.
Planned Enrollment

Number of human subject participants to be enrolled in an NIH-funded clinical research study. The data is provided in competing applications and annual progress reports.
Actual Enrollment

Number of human subjects enrolled in an NIH-funded clinical research study. The data is provided in annual progress reports.
NDA Collection

A virtual container and organization structure for data and associated documentation from one grant or one large project/consortium. It contains tools for tracking data submission and allows investigators to define a wide array of other elements that provide context for the data, including all general information regarding the data and source project, experimental parameters used to collect any event-based data contained in the Collection, methods, and other supporting documentation. They also allow investigators to link underlying data to an NDA Study, defining populations and subpopulations specific to research aims.
Data Use Limitations

Data Use Limitations (DULs) describe the appropriate secondary use of a dataset and are based on the original informed consent of a research participant. NDA only accepts consent-based data use limitations defined by the NIH Office of Science Policy.
Total Subjects Shared

The total number of unique subjects for whom data have been shared and are available for users with permission to access data.

Contact NDA Help Desk

ID	Name	Created Date	Status	Type
No records found.

helpcenter.collection.experiments-tab

Collection - Experiments

The number of Experiments included is displayed in parentheses next to the tab name. You may download all experiments associated with the Collection via the Download button. You may view individual experiments by clicking the Experiment Name and add them to the Filter Cart via the Add to Cart button.

Collection Owners, Program Officers, and users with Submission or Administrative Privileges for the Collection may create or edit an Experiment.

Please note: The creation of an NDA Experiment does not necessarily mean that data collected, according to the defined Experiment, has been submitted or shared.

Frequently Asked Questions

Can an Experiment be associated with more than one Collection?
Yes -see the “Copy” button in the bottom left when viewing an experiment. There are two actions that can be performed via this button:
1. Copy the experiment with intent for modifications.
2. Associate the experiment to the collection. No modifications can be made to the experiment.

Glossary

Experiment Status

An Experiment must be Approved before data using the associated Experiment_ID may be uploaded.
Experiment ID

The ID number automatically generated by NDA which must be included in the appropriate file when uploading data to link the Experiment Definition to the subject record.

Contact NDA Help Desk

Shared Data:

Title	Type	Number of Subjects
Altman Self-Rating Mania Scale	Clinical Assessments	665
Clinical Trials. Demographics	Clinical Assessments	665
Clinical Trials: Randomization	Clinical Assessments	665
Cognitive and Physical Functioning Scale	Clinical Assessments	665
Concise Associated Symptoms Tracking Scale	Clinical Assessments	665
Concise Health Risk Tracking	Clinical Assessments	665
Concomitant Medications	Clinical Assessments	665
Early Termination Form	Clinical Assessments	250
Eligibility Form	Clinical Assessments	731
Frequency Intensity Burden Side Effects	Clinical Assessments	633
Hamilton Rating Scale for Depression	Clinical Assessments	665
History of Neglect/Abuse	Clinical Assessments	665
Inventory of Depressive Symptomatology	Clinical Assessments	665
Medication History	Clinical Assessments	665
Medications Dispensing Form	Clinical Assessments	663
Menopausal Form	Clinical Assessments	665
Mini International Neuropsychiatric Interview. Part I	Clinical Assessments	665
Participant Adherence Questionnaire	Clinical Assessments	632
Pregnancy Outcome Form	Clinical Assessments	3
Protocol Violators	Clinical Assessments	396
Psychiatric Diagnostic Screening Questionnaire	Clinical Assessments	665
Quality of Life	Clinical Assessments	665
Quick Inventory of Depressive Symptomatology	Clinical Assessments	665
Research Subject	Clinical Assessments	482
Self Administered Comorbidity Questionairre	Clinical Assessments	665
Serious Adverse Events	Clinical Assessments	53
Suicide Questionnaire	Clinical Assessments	643
Systematic Assessment for Treatment Emergent Effects	Clinical Assessments	665
Vital Signs	Clinical Assessments	665
Work Productivity and Activity Impairment	Clinical Assessments	665
Work and Social Adjustment Scale Depression	Clinical Assessments	665

helpcenter.collection.shared-data-tab

Collection - Shared Data

This tab provides a quick overview of the Data Structure title, Data Type, and Number of Subjects that are currently Shared for the Collection. The information presented in this tab is automatically generated by NDA and cannot be edited. If no information is visible on this tab, this would indicate the Collection does not have shared data or the data is private.

The shared data is available to other researchers who have permission to access data in the Collection's designated Permission Group(s). Use the Download button to get all shared data from the Collection to the Filter Cart.

Frequently Asked Questions

How will I know if another researcher uses data that I shared through the NIMH Data Archive (NDA)?

To see what data your project have submitted are being used by a study, simply go the Associated Studies tab of your collection. Alternatively, you may review an NDA Study Attribution Report available on the General tab.
Can I get a supplement to share data from a completed research project?

Often it becomes more difficult to organize and format data electronically after the project has been completed and the information needed to create a GUID may not be available; however, you may still contact a program staff member at the appropriate funding institution for more information.
Can I get a supplement to share data from a research project that is still ongoing?

Unlike completed projects where researchers may not have the information needed to create a GUID and/or where the effort needed to organize and format data becomes prohibitive, ongoing projects have more of an opportunity to overcome these challenges. Please contact a program staff member at the appropriate funding institution for more information.

Glossary

Data Structure

A defined organization and group of Data Elements to represent an electronic definition of a measure, assessment, questionnaire, or collection of data points. Data structures that have been defined in the NDA Data Dictionary are available at https://nda.nih.gov/general-query.html?q=query=data-structure
Data Type

A grouping of data by similar characteristics such as Clinical Assessments, Omics, or Neurosignal data.
Shared

The term 'Shared' generally means available to others; however, there are some slightly different meanings based on what is Shared. A Shared NDA Study is viewable and searchable publicly regardless of the user's role or whether the user has an NDA account. A Shared NDA Study does not necessarily mean that data used in the NDA Study have been shared as this is independently determined. Data are shared according the schedule defined in a Collection's Data Expected Tab and/or in accordance with data sharing expectations in the NDA Data Sharing Terms and Conditions. Additionally, Supporting Documentation uploaded to a Collection may be shared independent of whether data are shared.

Contact NDA Help Desk

Collection Owners and those with Collection Administrator permission, may edit a collection. The following is currently available for Edit on this page:

Publications

Publications relevant to NDA data are listed below. Most displayed publications have been associated with the grant within Pubmed. Use the "+ New Publication" button to add new publications. Publications relevant/not relevant to data expected are categorized. Relevant publications are then linked to the underlying data by selecting the Create Study link. Study provides the ability to define cohorts, assign subjects, define outcome measures and lists the study type, data analysis and results. Analyzed data and results are expected in this way.

PubMed ID	Study	Title	Journal	Authors	Date	Status
No records found.

helpcenter.collection.publications-tab

Collection - Publications

The number of Publications is displayed in parentheses next to the tab name. Clicking on any of the Publication Titles will open the Publication in a new internet browsing tab.

Collection Owners, Program Officers, and users with Submission or Administrative Privileges for the Collection may mark a publication as either Relevant or Not Relevant in the Status column.

Frequently Asked Questions

How can I determine if a publication is relevant?

Publications are considered relevant to a collection when the data shared is directly related to the project or collection.
Where does the NDA get the publications?

PubMed, an online library containing journals, articles, and medical research. Sponsored by NiH and National Library of Medicine (NLM).

Glossary

Create Study

A link to the Create an NDA Study page that can be clicked to start creating an NDA Study with information such as the title, journal and authors automatically populated.
Not Determined Publication

Indicates that the publication has not yet been reviewed and/or marked as Relevant or Not Relevant so it has not been determined whether an NDA Study is expected.
Not Relevant Publication

A publication that is not based on data related to the aims of the grant/project associated with the Collection or not based on any data such as a review article and, therefore, an NDA Study is not expected to be created.
PubMed

PubMed provides citation information for biomedical and life sciences publications and is managed by the U.S. National Institutes of Health's National Library of Medicine.
PubMed ID

The PUBMed ID is the unique ID number for the publication as recorded in the PubMed database.
Relevant Publication

A publication that is based on data related to the aims of the grant/project associated with the Collection and, therefore, an NDA Study is expected to be created.

Contact NDA Help Desk

Collection Owners and those with Collection Administrator permission, may edit a collection. The following is currently available for Edit on this page:

Associated Studies

Studies that have been defined using data from a Collection are important criteria to determine the value of data shared. The number of subjects column displays the counts from this Collection that are included in a Study, out of the total number of subjects in that study. The Data Use column represents whether or not the study is a primary analysis of the data or a secondary analysis. State indicates whether the study is private or shared with the research community.

Study NameFilter by Study Name	DOIFilter by DOI	AbstractFilter by Abstract	Collection/Study SubjectsFilter by Collection/Study Subjects	Data UsageFilter by Data Usage	StateFilter by State
Towards Outcome-Driven Patient Subgroups: A Machine Learning Analysis Across Six Depression Treatment Studies	10.15154/1528714	Importance: Major depressive disorder (MDD) is a heterogeneous condition; multiple underlying neurobiological substrates could be associated with treatment response variability. Understanding the sources of this variability and predicting outcomes has been elusive. Machine learning (ML) has shown promise in predicting treatment response in MDD, but one limitation has been the lack of clinical interpretability of machine learning models, limiting clinician confidence in model results. Objective: To develop a machine learning model to derive treatment-relevant patient profiles using clinical and demographic information. Design: We analyzed data from six clinical trials of pharmacological treatment for depression (total n = 5438) using the Differential Prototypes Neural Network (DPNN), a neural network model that derives patient prototypes which can be used to derive treatment-relevant patient clusters while learning to generate probabilities for differential treatment response. A model classifying remission and outputting individual remission probabilities for five first-line monotherapies and three combination treatments was trained using clinical and demographic data. Setting: Previously-conducted clinical trials of antidepressant medications. Participants: Patients with MDD. Main outcomes and measures: Model validity and clinical utility were measured based on area under the curve (AUC) and expected improvement in sample remission rate with model-guided treatment, respectively. Post-hoc analyses yielded clusters (subgroups) based on patient prototypes learned during training. Prototypes were evaluated for interpretability by assessing differences in feature distributions (e.g. age, sex, symptom severity) and treatment-specific outcomes. Results: A 3-prototype model achieved an AUC of 0.66 and an expected absolute improvement in population remission rate of 6.5% (relative improvement of 15.6%). We identified three treatment-relevant patient clusters. Cluster A patients tended to be younger, to have increased levels of fatigue and more severe symptoms. Cluster B patients tended to be older, female with less severe symptoms, and the highest remission rates. Cluster C patients had more severe symptoms, lower remission rates, more psychomotor agitation, more intense suicidal ideation, more somatic genital symptoms, and showed improved remission with venlafaxine. Conclusion and Relevance: It is possible to produce novel treatment-relevant patient profiles using machine learning models; doing so may improve precision medicine for depression. Note: This model is not currently the subject of any active clinical trials and is not intended for clinical use.	745/6074	Secondary Analysis	Shared
Treatment selection using prototyping in latent-space with application to depression treatment	10.15154/1523049	Machine-assisted treatment selection commonly follows one of two paradigms: a fully personalized paradigm which ignores any possible clustering of patients; or a sub-grouping paradigm which ignores personal differences within the identified groups. While both paradigms have shown promising results, each of them suffers from important limitations. In this article, we propose a novel deep learning-based treatment selection approach that is shown to strike a balance between the two paradigms using latent-space prototyping. Our approach is specifically tailored for domains in which effective prototypes and sub-groups of patients are assumed to exist, but groupings relevant to the training objective are not observable in the non-latent space. In an extensive evaluation, using both synthetic and Major Depressive Disorder (MDD) real-world clinical data describing 4754 MDD patients from clinical trials for depression treatment, we show that our approach favorably compares with state-of-the-art approaches. Specifically, the model produced an 8% absolute and 23% relative improvement over random treatment allocation. This is potentially clinically significant, given the large number of patients with MDD. Therefore, the model can bring about a much desired leap forward in the way depression is treated today.	749/5946	Secondary Analysis	Shared
Analysis of Features Selected by a Deep Learning Model for Differential Treatment Selection in Depression	10.15154/1522874	Background: Deep learning has utility in predicting differential antidepressant treatment response among patients with major depressive disorder, yet there remains a paucity of research describing how to interpret deep learning models in a clinically or etiologically meaningful way. In this paper, we describe methods for analyzing deep learning models of clinical and demographic psychiatric data, using our recent work on a deep learning model of STARD and CO-MED remission prediction. Methods: Our deep learning analysis with STARD and CO-MED yielded four models that predicted response to the four treatments used across the two datasets. Here, we use classical statistics and simple data representations to improve interpretability of the features output by our deep learning model and provide finer grained understanding of their clinical and etiological significance. Specifically, we use representations derived from our model to yield features predicting both treatment non-response and differential treatment response to four standard antidepressants, and use linear regression and t-tests to address questions about the contribution of trauma, education, and somatic symptoms to our models. Results: Traditional statistics were able to probe the input features of our deep learning models, reproducing results from previous research, while providing novel insights into depression causes and treatments. We found that specific features were predictive of treatment response, and were able to break these down by treatment and non-response categories; that specific trauma indices were differentially predictive of baseline depression severity; that somatic symptoms were significantly different between males and females, and that education and low income proved important psycho-social stressors associated with depression. Conclusion: Traditional statistics can augment interpretation of deep learning models. Such interpretation can lend us new hypotheses about depression and contribute to building causal models of etiology and prognosis. We discuss dataset-specific effects and ideal clinical samples for machine learning analysis aimed at improving tools to assist in optimizing treatment.	750/4800	Secondary Analysis	Shared
Differential Treatment Benefit Prediction for Treatment Selection in Depression: A Deep Learning Analysis of STAR*D and CO-MED Data	10.15154/1522873	Depression affects one in nine people, but treatment response rates remain low. There is significant potential in the use of computational modeling techniques to predict individual patient responses and thus provide more personalized treatment. Deep learning is a promising computational technique that can be used for differential treatment selection based on predicted remission probability. Using Sequenced Treatment Alternatives to Relieve Depression (STARD) and Combining Medications to Enhance Depression Outcomes (CO-MED) trial data, we employed deep neural networks to predict remission after feature selection. Treatments included were citalopram, escitalopram, bupropion SR plus escitalopram, and venlafaxine plus mirtazapine. Differential treatment benefit was estimated in terms of improvement of population remission rates after application of the model for treatment selection using two approaches: (1) using predictions generated directly from the model (the predicted improvement approach) and (2) using bootstrapping for sample generation and then estimating population remission rate for patients who actually received the drug predicted by the model compared to the general population (the actual improvement approach). Our deep learning model predicted remission in a pooled CO-MED/STARD dataset (including four treatments) with an area under the curve of 0.69 using 17 input features. Our actual improvement analysis showed a statistically significant 2.48% absolute improvement (corresponding to a 7.2% relative improvement) in population remission rate (p = 0.01, CI 2.48% ± 0.5%). Our model serves as proof-of-concept that deep learning approaches, with further refinement and work to address concerns about differences between studies when multiple datasets are used for training, may have utility in differential prediction of antidepressant response when selecting from a number of treatment options.	750/4800	Secondary Analysis	Shared
Summary Measures for Quantifying the Extent of Visit Irregularity in Longitudinal Data: The STAR*D Study	10.15154/1518466	This chapter applies the measures of irregularity from this thesis to the Sequenced Treatment Alternatives to Relieve Depression (STARD) study. The STARD study is the largest randomized clinical trial on patients suffering from major depression. This chapter focuses on the first phase of the study which pre-specified a common set of scheduled measurement occasions at weeks 2, 4, 6, 9, 12 post-baseline where individuals had their Quick Inventory of Depression Symptomatology (QIDS) questionnaire score recorded; however there were individuals who missed scheduled visits, and had unscheduled visits. Therefore, interest lies in determining whether visits can be treated as repeated measures. This is followed by a demonstration on how to select the appropriate modelling approach for the study outcome, and how to interpret the resulting parameter estimates. The target of inference of this chapter is to evaluate the mean QIDS score over the first 12 weeks of the trial.	76/4036	Secondary Analysis	Shared
Studying treatment-effect heterogeneity in precision medicine through induced subgroups	10.15154/1503440	Precision medicine, in the sense of tailoring the choice of medical treatment to patients’ pretreatment characteristics, is nowadays gaining a lot of attention. Preferably, this tailoring should be realized in an evidencebased way, with key evidence in this regard pertaining to subgroups of patients that respond differentially to treatment (i.e., to subgroups involved in treatment–subgroup interactions). Often a-priori hypotheses on subgroups involved in treatment–subgroup interactions are lacking or are incomplete at best. Therefore, methods are needed that can induce such subgroups from empirical data on treatment effectiveness in a post hoc manner. Recently, quite a few such methods have been developed. So far, however, there is little empirical experience in their usage. This may be problematic for medical statisticians and statistically minded medical researchers, as many (nontrivial) choices have to be made during the dataanalytic process. The main purpose of this paper is to discuss the major concepts and considerations when using these methods. This discussion will be based on a systematic, conceptual, and technical analysis of the type of research questions at play, and of the type of data that the methods can handle along with the available software, and a review of available empirical evidence. We will illustrate all this with the analysis of a dataset comparing several anti-depressant treatments.	665/665	Secondary Analysis	Shared
Consistent differential effects of bupropion and mirtazapine in major depression	10.15154/qzg7-n302	Background: Patients with major depression exhibit heterogeneous symptom profiles and variable responses to antidepressants. Most clinical trials rely on aggregate outcomes such as total symptom severity or remission rates, which often obscure meaningful differences in treatment response. Methods: We applied the Supervised Varimax (SV) algorithm to identify outcome dimensions that maximally differentiate antidepressants based on symptom-level effects. We analyzed all relevant levels of the STARD trial and validated findings in the independent CO-MED study. We assessed statistical significance using permutation testing with familywise error rate (FWER) correction. Results: SV consistently identified interpretable and statistically significant differences between bupropion, mirtazapine, and other antidepressants. In STARD, bupropion monotherapy produced greater improvement in hypersomnia than venlafaxine in Levels 2 and 2A (n=686, difference = 0.384, p_{FWER}=0.007). Bupropion augmentation outperformed buspirone augmentation for increased weight, increased appetite, and fatigue in Level 2 (n=520, difference = -0.322, p_{FWER}=0.005). Mirtazapine monotherapy outperformed nortriptyline for insomnia, decreased weight, and decreased appetite in Level 3 (n=214, difference = 0.401, p_{FWER}=0.022), and venlafaxine with mirtazapine similarly outperformed tranylcypromine in Level 4 (n=102, difference = -0.722, p_{FWER}=0.004). In CO-MED, escitalopram with bupropion and venlafaxine with mirtazapine demonstrated complementary symptom-specific benefits (n=640, difference = -0.302, p_{FWER}=0.022). Conclusion: Bupropion is most effective for hypersomnia, increased weight, increased appetite, or fatigue, while mirtazapine is preferable for insomnia, decreased weight, or decreased appetite. SV enables statistically rigorous, symptom-level differentiation using only treatment assignment, offering a scalable and clinically aligned framework for guiding antidepressant selection from individual clinical trials.	78/82	Secondary Analysis	Shared

* Data not on individual level

helpcenter.collection.associated-studies-tab

Collection - Associated Studies

Clicking on the Study Title will open the study details in a new internet browser tab. The Abstract is available for viewing, providing the background explanation of the study, as provided by the Collection Owner.

Primary v. Secondary Analysis: The Data Usage column will have one of these two choices. An associated study that is listed as being used for Primary Analysis indicates at least some and potentially all of the data used was originally collected by the creator of the NDA Study. Secondary Analysis indicates the Study owner was not involved in the collection of data, and may be used as supporting data.

Private v. Shared State: Studies that remain private indicate the associated study is only available to users who are able to access the collection. A shared study is accessible to the general public.

Frequently Asked Questions

How do I associate a study to my collection?

Studies are associated to the Collection automatically when the data is defined in the Study.

Glossary

Associated Studies Tab

A tab in a Collection that lists the NDA Studies that have been created using data from that Collection including both Primary and Secondary Analysis NDA Studies.

Contact NDA Help Desk

Edit

Choose File:	Select File
File Type:
Description:

Exemption Type*
From Date*
To Date*
Reason*	Characters Remaining:

Disclaimer

Filter Cart

Frequently Asked Questions

Glossary

NDA Help Center

Collection - General Tab

Frequently Asked Questions

Glossary

NDA Help Center

Collection - Experiments

Frequently Asked Questions

Glossary

NDA Help Center

Collection - Shared Data

Frequently Asked Questions

Glossary

Publications

NDA Help Center

Collection - Publications

Frequently Asked Questions

Glossary

Associated Studies

NDA Help Center

Collection - Associated Studies

Frequently Asked Questions

Glossary

New Password:
Repeat New Password: