Stimuli

Stimulus Files Organization

Stimulus files MUST be stored in the /stimuli directory under the root directory of the dataset. The /stimuli directory can contain subdirectories to organize the stimulus files. Stimulus files MUST follow the BIDS naming conventions and are referenced in the events.tsv file using the stim_id column.

The standardization of stimulus files and their annotations within BIDS offers several key benefits:

Consistency: Ensures uniform storage and referencing across datasets
Reusability: Enables stimulus reuse across studies through standardized structure
Efficiency: Minimizes redundancy by centralizing annotations
Flexibility: Facilitates dataset reuse with alternative annotations

To preserve backward compatibility with existing datasets (see the Legacy section below), the use of these specifications for the /stimuli directory and the stim_id column in the events.tsv files is RECOMMENDED but not required. Researchers are encouraged to follow these guidelines to enhance the interoperability and reproducibility of their studies.

Following these guidelines will help ensure that stimulus files and their annotations are stored and referenced consistently across different datasets, facilitating data sharing, reuse, and reproducibility.

File Organization

└─ stimuli/
   ├─ stimuli.tsv 
   ├─ stimuli.json 
   ├─ [stim-<label>[_part-<label>]_<suffix>.<extension>] 
   ├─ [stim-<label>[_part-<label>]_<suffix>.json] 
   ├─ [[stim-<label>_]annotations.tsv] 
   ├─ [[stim-<label>_]annotations.json] 
   ├─ [stim-<label>[_part-<label>]_annot-<label>_events.tsv] 
   └─ [stim-<label>[_part-<label>]_annot-<label>_events.json]

Note: The presence of the stimuli.tsv file indicates that the content of the /stimuli directory follows this BIDS specification for stimulus organization.

Stimulus File Formats

The following table lists the supported stimulus file formats and their corresponding suffixes. The suffixes are used to identify the type of stimulus file and are appended to the stim-<label> prefix in the file name.

suffix	extensions	description
audio	`.wav`, `.mp3`, `.aac`, `.ogg`	Audio-only stimulus files
image	`.jpg`, `.png`, `.svg`, `.webp`	Static visual stimulus files
video	`.mp4`, `.avi`, `.mkv`, `.webm`	Video-only stimulus files
audiovideo	`.mp4`, `.avi`, `.mkv`, `.webm`	Combined audio-visual files

Stimulus description (`stim-<label>_<suffix>.json`)

The stim-<label>_<suffix>.json file provides metadata about the singular stimulus file. The following fields are defined to describe the stimulus file:

Key name	Requirement Level	Data type	Description
License	RECOMMENDED	string	License under which this stimulus is shared.
Copyright	RECOMMENDED	string	Copyright information
URL	OPTIONAL	string	Location (origin) for the stimulus file.
Description	OPTIONAL	string	Free-form natural language description.

In some cases, such as observing the copyright of a stimulus file, the actual stimulus file may not be shared. In such cases, the stim-<label>_<suffix>.json file SHOULD be used to provide metadata about the stimulus file, including the license, copyright, URL, and description.

Example `stim-<label>_<suffix>.json`

{
    "License": "CC-BY-4.0",
    "Copyright": "2023 Lab Name lab@university.edu",
    "URL": "https://example.com/stimuli/",
    "Description": "Collection of face images, tones, and movie clips used in the experiment"
}

The License field SHOULD provide the known identifiers, such as PDL, CC0, CC-BY from the BIDS Licensees Appendix, or common license lists such as SPDX or Creative Commons. The Copyright filed SHOULD provide the year, copyright holder's name, and if available, the email address of the copyright holder. If the stimulus file is not shared, the URL field SHOULD provide a link to the stimulus file.

Stimuli Description (`stimuli.tsv`)

The stimuli.tsv files are used to provide information about the stimuli based on their stim_id. This file is similar in usage as participants.tsv, scans.tsv and sessions.tsv, which list descriptions about subjects, scans and sessions, respectively. The stimuli.tsv files MUST be placed in the /stimuli directory.

The stimuli.tsv file contains information about each stimulus, including stimulus ID, type, URL, and other relevant details. The following table describes the REQUIRED, RECOMMENDED, and OPTIONAL columns for the stimuli.tsv file:

Column name	Requirement Level	Data type	Description
stimulus_id	REQUIRED	string	Unique identifier for a specific stimulus. This column must appear first in the file.
type	REQUIRED	string	Type of stimulus (for example, image, audio, video). Refers to the suffix of the stimulus file if present. This column may appear anywhere in the file.
URL	OPTIONAL	string	Uniform Resource Locator pointing to the stimulus file, particularly useful if the stimulus file is not distributed with the dataset. This column may appear anywhere in the file.
license	RECOMMENDED	string	License under which the stimulus file is distributed. SHOULD use identifiers from recognized lists like SPDX or Creative Commons. This column may appear anywhere in the file.
copyright	RECOMMENDED	string	Copyright information for the stimulus file, typically including year and holder. This column may appear anywhere in the file.
description	RECOMMENDED	string	Free-form text description of the stimulus identified by `stimulus_id`. This column may appear anywhere in the file.
HED	OPTIONAL	string	Hierarchical Event Descriptor (HED) Tag. See the HED Appendix for details. This column may appear anywhere in the file.
filename	OPTIONAL	string	Relative paths to files. This column may appear anywhere in the file.
present	OPTIONAL	boolean	Indicates whether the stimulus file corresponding to the `stimulus_id` is physically present in the dataset. This column may appear anywhere in the file. Must be one of: `"true"`, `"false"`.
partDescription	OPTIONAL	string	Free-form text description of a specific part of a stimulus, relevant when the stimulus is split into multiple files using the `part` entity. This column may appear anywhere in the file.
Additional Columns	OPTIONAL	`n/a`	Additional columns are allowed if they are defined in the associated metadata file.

Example `stimuli.tsv`

stimulus_id    type     URL                                     license      copyright    description                                  present
stim-face01   image    https://example.com/faces/face01.jpg    CC-BY-4.0   Lab 2023    A female face with neutral expression       true
stim-tone01   audio    https://example.com/tones/tone01.wav    CC-BY-4.0   Lab 2023    A 440Hz pure tone                          true
stim-movie01  video    https://example.com/movies/movie01.mp4  n/a         Studio XYZ  A clip from copyrighted movie              false

The stimuli.json file provides detailed descriptions of the columns in the stimuli.tsv file. There MAY be extra entries in the stimuli.json in addition to the columns in the stimuli.tsv to provide more details about the stimulus.

In cases where the stimulus is not shared, the stimuli.tsv file can be used to provide metadata about the stimuli, including the license, copyright, URL, and description. This is similar to the use of stim-<label>_<suffix>.json files for individual stimuli files. In the case of conflict between the metadata in the stimuli.tsv and stim-<label>_<suffix>.json files, the metadata in the stim-<label>_<suffix>.json file takes precedence.

Stimulus Annotations

Annotations of the still images or general description of the stimuli (such as frequency and duration of a beep sound) can be stored in the stimuli.tsv as an additional column or stim-<label>_<suffix>.json as described above. Here is an example of how annotations can be stored in the stimuli.tsv file for an image from the Natural Scene Dataset (NSD):

stimulus_id	type	description	HED	NSD_id	COCO_id
stim-nsd02951	image	an open market full of people and piles of vegetables	((Item-count, High), Ingestible-object), (Background-view, ((Human, Body, Agent-trait/Adult), Outdoors, Furnishing, Natural-feature/Sky, Urban, Man-made-object))	2951	262145

However, for time-varying stimuli, such as audio or video, it is RECOMMENDED to use specific annotations files in the form of stim-<label>_annot-<label>_events.tsv to store the annotations. These files have the same structure as the events.tsv files and are used to store annotations for the stimuli. There can be multiple annotation files for a single stimulus file, each with a unique annotation label. The annotation files MUST be stored in the /stimuli directory.

Annotation Description (`annotations.tsv`)

The annotations.tsv file contains additional metadata about stimulus annotations. There MAY be a single annotations.tsv file for all the stimuli or separate stim-<label>_annotations.tsv files for each stimulus. The following columns are defined for the annotations.tsv file:

Column name	Requirement Level	Data type	Description
annot_id	REQUIRED	string	Unique identifier for a specific annotation set applying to a stimulus or group of stimuli. Links to annotation files (for example, `_annot-<label>_events.tsv`). This column must appear first* in the file.
description	REQUIRED	string	Free-form text description of the annotation identified by `annot_id`. This column may appear anywhere in the file.
Additional Columns	OPTIONAL	`n/a`	Additional columns are allowed if they are defined in the associated metadata file.

Example `*_annotations.tsv`

annot_id     description
face01_emo   Emotion annotation for face01 stimulus
face01_gen   Gender annotation for face01 stimulus
face01_age   Age group annotation for face01 stimulus

Referencing Stimulus Identifiers in `events.tsv`

To reference stimulus identifiers in the events.tsv file, use the stim_id column. The values in the stim_id column should represent unique identifiers for the stimuli. Stimulus ID (stim_id) should correspond to the unique identifier of the stimulus file in the /stimuli directory and expand to all files (both stimulus and annotation files) that share the same stimulus ID.

Example events.tsv file:

onset	duration	trial_type	response_time	stim_id
1.23	0.65	start	1.435	`stim-<label>`
5.65	0.65	stop	1.739	`stim-<label>`
12.1	2.35	n/a	n/a	`stim-<label>`

In the accompanying JSON sidecar, the stim_id column might be described as follows:

{
    "stim_id": {
        "LongName": "Stimulus identifier",
        "Description": "Represents a unique identifier for the stimulus presented at the given onset time."
    }
}