Many of the available EEG and MEG systems come with analysis software packages with varying levels of detailed descriptions of how the different preprocessing tools are implemented. In addition, several freely available software packages that run on MATLAB/Python/R platforms, or commercial data analysis packages offer alternative implementations of data analysis tools. In addition, custom-written software can be used. The software that has been used for the preprocessing and subsequent analysis must be indicated (including the version). In-house software should be described in explicit detail (or peer-reviewed references or pre-prints with such details cited). The source code should be publicly released and access links should be provided (e.g., GitHub or another readily accessible internet-based location).
Preprocessing is a crucial step in MEEG signal analysis as data are typically distorted due to various factors. The sequence of analysis steps of the preprocessing pipeline and their order influences the data to be used for subsequent analysis. The workflow, therefore, has to be described step-by-step and with such a level of detail that it could be exactly reproduced by another researcher. For most studies, recommended steps after general visual data inspection include: 1) identification and removal of electrodes/sensors with poor signal quality. It is essential to clearly describe the methodology and the criteria used, particularly if interpolation is used. 2) artifact identification and removal. Similarly state the method and criteria used to identify artefact, and the method used in case correction is performed; 3) detrending (when and if appropriate); 4) digital low- and high-pass digital filtering with filter-type characteristics (IIR/FIR, type of filter [e.g., Butterworth, Chebyshev etc], cut-off frequency, roll-off/order etc.); 5) data segmentation (if performed); 6) additional identification/elimination of artifacts (blinks, cardiac activity etc); 7) baseline correction (when and if appropriate); 8) re-referencing for EEG (e.g., earlobe/mastoid-reference, common-average reference, bipolar, surface Laplacian; when and if desired).
The steps and sequence described above are appropriate for most basic analyses of data. That said, for specifics of analyses, or due to specific data characteristics, the order of processing may vary for scientific reasons. For example, data segmentation could occur at different points in the pipeline, depending in part on the specific artifact removal methods used. Data re-referencing could theoretically be performed at various points in the pipeline also, but it is important to note that re-referencing can spread artifacts. The committee recognizes that investigators require a pipeline where the order of steps is taken for specific reasons, and hence we are not prescriptive about a particular order of data analysis. That said, for each study, the order of the steps in the preprocessing pipeline should be made very clear, so that other investigators can replicate the study.
Visual inspection of the spatiotemporal structure in the signals after each step is recommended and, if needed, remaining epochs with poor data quality should be marked and excluded from further analysis. When such epochs are additionally rejected, a record should be provided such that the same analysis could be reproduced from the raw data (e.g., after preprocessing epochs 10, 25, 45, 60 were additionally excluded showing residual artifacts). During preprocessing, topographic maps of the distribution of the means and variances of scalp voltages (for EEG) and magnetic fields exiting and entering the head (for MEG) might serve as an additional tool for spotting channels with poor data quality that might escape detection in waveform displays (Michel et al., 2009 Electrical Neuroimaging).
Artifacts and filtering
Artifacts from many different sources can contaminate MEEG data and must be identified and/or removed. Artifacts can be of non-physiological (bad electrode contact, power line, etc.) or physiological (pulse, muscle activity, sweating, movement, etc.) origin. The data should first be visually inspected (where the investigator is blind to experimental conditions) to ascertain what types of artifact are actually present in the data. Subsequently, established artifact identification/removal pipelines can be run, or a plan can be constructed for data segments with artifacts to be excluded.
If automatic artifact detection methods are used, they should be followed up by visual inspection of the data. Any operations performed on the data (see Section 4.1 workflow) must be described, specifying the parameters of the algorithm used. It is recommended to describe in detail the type of detrending performed and the algorithm order (e.g., linear 1st order, piecewise, etc). When automatic artifact rejection/correction is performed, which method was used and what was the range of parameters (e.g., EEG data with a range bigger than 75 microV, epoch rejected based on 3 SD deviation from the mean kurtosis). Similarly for channel interpolation, one must specify the method and order (e.g., trilinear, spline nth order). When ICA is used, describe the algorithm and parameters used, including the number of ICs. If artefacts are rejected using ICA, report how these were identified. It is worthwhile to also consider including topographies of components in the Supplementary Materials section of manuscripts. If manual artifact rejection procedures are used, it is essential to describe what types of features in the MEEG signal were identified and define the criteria used to reject segments of data. This is also essential for allowing the reader to reproduce the results, as well as compare results between studies (see above on reporting visually removed trials (epochs) for instance). Once artefacts have been removed, the average number of remaining trials per condition should also be reported.
In MEG specialized procedures for correcting data that contain artifacts can be used, eliminating the need to reject data. These include techniques such as signal-space projection methods (SSP, Uusiltalo & Ilmoniemi, 1997)for that use “empty room” measurements with MEG data to differentiate external sources of interference from brain activity, and signal space separation (SSS) methods and their temporally extended variants (tSSS, Taulu et al., 2004; Taulu & Simola, 2006) that rely on the geometric separation of brain activity from noise signals in MEG data. SSS methods have been recommended as being superior to SSP (Haumann et al., 2016). The ordering of preprocessing steps for cleaning MEG data is particularly important, due to potential data transformation – for some caveats see Gross et al., 2013.
For both MEG and EEG data, particular attention must be taken to describe filtering, as this can have dramatic consequences on estimating time-courses and phases (Rousselet, 2012; Widmann & Schröger, 2012). Some investigators have advocated the use of a sampling rate that is 4 times above the putative cut-off frequency of the low pass filter used (Luck et al 2014 and latest IFCN guidelines). That said, the roll-off rate/slope of the filter must also be taken into consideration and therefore specified with a description of the type, bands etc. of the filters used. The type and parameters of any applied post-hoc filter and (for EEG, EOG and EMG) re-computed references must be specified as they crucially affect the outputs of waveform or frequency analyses (but not topographies). For frequency and time-frequency analyses, details on the transformation algorithm must similarly be reported.
Data preprocessing also forms an essential part of multivariate techniques, and can dramatically affect decoding performance (Guggenmos et al., 2018). We recommend to carefully describe the method used, in particular, if noise normalization is performed channel wise (univariate normalization) or for all channels together (multivariate normalization, or whitening). For the latter, the covariance characterization must be specified (based on baseline, epochs, or for each time point).
EEG is a differential measure and in non-clinical EEG is usually recorded relative to a fixed reference (in contrast to clinical practice, which usually uses bipolar montages). While EEG is recorded relative to some reference, it can later be re-referenced by subtracting the values of another channel or weighted sum of channels from all channels. The need for re-referencing depends on the goals of the analysis and EEG measures used (e.g., the average reference, see below) is beneficial for evaluation of coherence and for source modelling, although it may produce an inflated level of synchrony or leakage of activity from one region to another when assessed in sensor space). Re-referencing does not change the shape of the scalp topography at a single time sample, however, this can cause issues when working in sensor space with isolated sensors (Hari & Puce, 2017). Specifically, the shape of the recorded waveforms at specific electrodes can be altered and this will also affect the degree of distortion of waveforms by artifacts. Hence, when comparing across experiments, the references used should be taken into account, and if unusual, the reference choice should be justified.
Re-referencing relative to the average of all channels (common average reference, CAR) is most common for high-density recordings as the first step in EEG source localization. The main assumption behind the AR is that the summed potentials from electrodes spaced evenly across the entire head should be zero (Bertrand et al., 1985). While this is usually not a problem for EEG datasets of 128 sensors or more (Srinivasan et al., 1998; Nunez & Srinivasan, 2006), for low density recordings and ROI-based analyses in sensor space there is a serious risk of violating the assumptions for the average reference and the possibility of introducing shifts in potentials (Hari & Puce, 2017). Hence, the AR should be avoided in low-density recordings (e.g., < 64 channels).
An alternative to the AR is the Reference Electrode Standardization Technique (or REST) (Yao, Dezhong, 2001). Both the AR and REST have been shown to be the extremes of a family of Bayesian reference estimators (Hu et al. 2018). If the focus of the data analysis will be on source space inference no re-referencing is theoretically necessary. That said, re-referencing of the data may be useful for comparisons to existing literature.
For MEG data, the order (e.g., 3rd order with CTF data) of gradiometer re-referencing should be reported (if applicable), as well as when in the preprocessing pipeline this step occurs.
Spectral and time-frequency analysis
A common approach for the analysis of MEEG data is to examine the data in terms of its frequency content, and these analyses are applicable for both task-related as well as resting state designs. One important caveat for these types of analyses is that the acquired data should have been sampled at around 5 times the oscillatory frequencies of interest, due to the analogue anti-aliasing filter characteristics – underscoring the importance of planning all data analyses prior to data acquisition, ideally during the design of the study.
In task-related designs MEEG activity can be classified as evoked (i.e., be phase-locked to task events/stimulus presentation) or induced (i.e., related to the event, but not exactly phase-locked to it). Hence, it is important to specify what type of activity is being studied. The domain in which the analysis proceeds (time and frequency or frequency alone) should be specified, as should the spectral decomposition method used (e.g., wavelets, spectral analyses), and whether the data are expressed in sensor or source space. These methods can be the precursor to the assessment of functional connectivity (see Section 4.6).
The spectral decomposition algorithm, as well as parameters used, should be specified in sufficient detail since these crucially affect the outcome. Therefore, depending on the decomposition method used (e.g., wavelet convolution, Fourier decomposition, Hilbert transformation of bandpass-filtered signals, or parametric spectral estimation), one should describe the type of wavelet (including the tuning parameters), the exact frequency or time-frequency parameters (frequency and time resolutions), exact frequency bands, number of data points, zero padding, windowing (e.g., a Hann or Hanning window), spectral smoothing need to be specified. It is relevant to note that the required frequency resolution is defined as the minimum frequency interval that two distinct underlying oscillatory components need to have in order to be dissociated in the analysis (Bloomfield, 2004; Boashash, 2003). This should not be mistaken with the increments at which the frequency values are reported (e.g., when smoothing or oversampling is used in the analyses). When using overlapping windows (e.g., in Welch’s method) or using Multi-taper windows for robust estimation, the potential spectral smoothing may lead to closely spaced narrow frequency bands to blend. This should be carefully considered and reported.
MEEG data are recorded from outside the head. Source modelling is an attempt to explain the spatio-temporal pattern of the recorded data in sensor space as resulting from the activity of specific neural sources within the brain (within source space), a process known as solving the inverse problem. Since there is no unique solution to the inverse problem, it is mathematically ill-posed and needs to be estimated using additional assumptions. Source modelling first requires a solution of the forward problem, which predicts the effect of the tissues in the head on the propagation of activity to MEEG sensors. These procedures require a volume conduction model of the head and a source model, both of which can crucially influence the accuracy and reliability of the results (Baillet et al., 2001; Michel & He, 2018). Practically, the forward model (or lead field matrix) describes the magnetic field or potential distributions in sensor space that result from a predefined set of (unit amplitude) sources. The sources are typically defined either in a volumetric grid or on a cortically constrained sheet. Information from the forward model is then used to estimate the solution of the inverse problem, in which the measured MEEG signals are attributed to active sources within the brain. It is important to note that source modelling procedures essentially provide approximations of the inverse solution as solved under very specific assumptions or constraints.
In addition to the MEEG data itself, two other types of data are needed to generate solutions to the forward and inverse problems. These are the spatial locations of the sensors and fiducials relative to the head (Section 3.2), and details of the type of geometric data that the MEEG data will be coregistered e.g., spherical head model or structural MRI. The procedure used to coregister the locations of measurement sensors and fiducials with geometric data must be described (see Section 2.1 for definitions; Section 3.2 for sensor digitization methods). If using structural MRI data, it should be made clear if a normalized structural MRI volume such as the MNI152 template, or individual participant MRIs are being used for data analyses. If high-resolution structural MRIs are being acquired in individual participants, all data acquisition parameters, as well as normalization procedures should be described.
It is essential that all details of the head model and the source model are given. For EEG, the numerical method (e.g., boundary element modelling (BEM), finite element modelling (FEM)) used to model the head by reconstruction from the structural MRI of the head must be reported, and the values of electrical conductivity of the different tissues that were used in the calculations must be specified. This is less of a problem for MEG where magnetic fields are not greatly distorted by passing through different tissue types (Baillet, 2017). For the volume conductor model, the type of distribution of the solution space and the distance between points in the space need to be reported, as well as the method used to extract the gray matter mantle, if the solution points are restricted to it. In this case, the head model should ideally be based on an individual MRI of the participant’s head, especially in clinical studies where brain lesions or malformations may be involved. In certain clinical settings, approximate head models might be adequate, although their limitations should be explicitly acknowledged (Valdés-Hernández et al. 2009). The source localization method (e.g., equivalent current dipole fitting, distributed model, dipole scanning), software and its version (e.g., BESA, Brainstorm, Fieldtrip, LORETA, MNE, Nutmeg, SPM, etc.) must be reported, with inclusion of parameters used (e.g., the regularization parameter) and appropriate reference to the technical paper describing the method in detail.
We refer here to connectivity analyses as any methods that aim to detect functional coupling between two or more channels or sources. It is important to report and justify the use of either sensor or source space for the calculation of derived metrics of coupling (e.g., network measures such as centrality or complexity). A recent general reference on connectivity measures can be found in O’Neill et al. (2018).
When using multivariate measures (either data-driven or model-based) such as partial coherence and multiple coherence, the exact variables that have been analysed and the exact variables with respect to which the data was partialised, marginalised, or conditioned should be determined. When reporting measures of data-driven spectral coherence or synchrony (Halliday et al. 1995) to follow should be considered and reported: the exact formulation (or reference), whether the measure has been debiased, any subtraction or normalisation with respect to an experimental condition or a mathematical criterion. In case of Auto-Regressive (AR)-based multivariate modelling (e.g., in the Partial Directed Coherence group of measures; Baccala and Sameshima (2001), the exact model parameters (number of variables, data points and window lengths, as well as the estimation methods and fitting criteria) should be reported.
While the committee agrees that statistical metrics of dependency can be obtained at the channel level, it should be clear that these are not per se measures of neural connectivity Haufe et al., 2012). The latter can only be obtained by an inferential process that compensates for volume conduction and spurious connections due to unobserved common sources or cascade effects. In spite of that, dependency measures can be useful for e.g., biomarking. The possible insight into brain function derived from these measures should be critically discussed. This is particularly important since the interpretation of MEEG-based connectivity metrics may be confounded by aspects of the data that do not directly reflect true neural events (Schoffelen & Gross J. 2009, Valdes-Sosa et al. 2011). Inference about connectivity between neural masses can only be performed with dependency measures at the source level and correct inferential procedures.
Table 3. Data pre-processing and processing checking-sheet
||– Indicate in detail the exact order in which preprocessing steps took place
||– Which software and version was/were used for preprocessing and processing, as well as the analysis platform
– In-house code, should be shared/made public
||– Indicate any downsampling of the data
– If electrodes/sensors were removed, which identification method was used, which ones were deleted, if missing channel interpolation is performed indicate which method
– Specify detrending method (typically polynomial order) for baseline correction
– Specify noise normalization method (typically used in multivariate analyses)
– If data segmentation is performed, indicate the number of epochs per subject per condition
– Indicate the spectral decomposition algorithm and parameters, and if applied before/after segmentation
|Detection/rejection/correction of artifacts
||– Indicate what types of artifact present in the data
– For automatic artifact detection, describe algorithms used and their respective parameters (e.g., amplitude thresholds)
– For manual detection, indicate the criteria used with as much detail as needed for reproducibility
– Indicate if trials with artifacts were rejected or corrected. If using correction, indicate method(s) and parameters
– If trials/segments of data with artifacts have been removed, indicate the average number of remaining trials per condition across participants (include minimum and maximum number of trials across participants)
– For resting state data, specify the length of time of the artifact-free data
|Correction of artifacts using BSS/ICA
||– Indicate how many total components were generated, what type of artifact was identified and how, and how many components were removed (on average across participants)
– Display example topographies of the ICs that were removed
||– Indicate type of filter
– Include filter parameters: frequency cutoff , roll-off rate/slope, etc.
|Re-referencing (for EEG)
||– Report the digital reference and how this was computed
– Justify choice of the re-reference scheme
||– Method of co-registration of measurement sensors to structural MRI scan of the participant’s head or MRI template (for EEG in particular)
– Volume conductor model (e.g., BEM/FEM) and tissue conductivity values (for EEG)
– Source model details (e.g., dipole, distributed,dipole scanning)
– Report parameters used for source estimation (i.e., regularization of the data covariance matrix; constraints used for source model)
||– Sensor or source space?
– Detail exact variables that have been analysed (which of the data was partialised, marginalised, or conditioned)
– For model based approach, indicate model parameters
– Specify metrics of coupling