Individual differences in successful self-regulation of the dopaminergic midbrain

Lydia Hellrung; Matthias Kirschner; James Sulzer; Ronald Sladky; Frank Scharnowski; Marcus Herdener; Philippe N. Tobler

doi:10.1101/863639

Abstract

The dopaminergic midbrain is associated with brain functions, such as reinforcement learning, motivation and decision-making that are often disturbed in neuropsychiatric disease. Previous research has shown that activity in the dopaminergic midbrain can be endogenously modulated via neurofeedback, suggesting potential for non-pharmacological interventions. However, the robustness of endogenous modulation, a requirement for clinical translation, is unclear. Here, we examined how self-modulation capability relates to regulation transfer. Moreover, to elucidate potential mechanisms underlying successful self-regulation, we studied individual prediction error coding, and, during an independent monetary incentive delay (MID) task, individual reward sensitivity. Fifty-nine participants underwent neurofeedback training either in a veridical or inverted feedback group. Successful self-regulation was associated with post-training activity within the cognitive control network and accompanied by decreasing prefrontal prediction error signals and increased prefrontal reward sensitivity in the MID task. The correlative link of dopaminergic self-regulation with individual differences in prefrontal prediction error and reward sensitivity suggests that reinforcement learning contributes to successful self-regulation. Our findings therefore provide new insights in the control of dopaminergic midbrain activity and pave the way to improve neurofeedback training in neuropsychiatric patients.

1 Introduction

The dopaminergic midbrain, including the ventral tegmental area (VTA) and substantia nigra (SN), plays a crucial role in reward processing, reinforcement learning^1–4, motivation^5,6, and decision-making⁷. Dysfunctions of the reward system have far-reaching consequences and are associated with the development of several severe psychiatric disease such as addiction⁸ and schizophrenia^9,10. Despite decades of extensive neuroscience and imaging studies which have contributed to an impressive body of knowledge of normal and abnormal reward system function, the neural mechanisms controlling midbrain activity are still not fully understood¹¹. One key issue that has received increasing attention is whether humans are able to cognitively control brain activity within the reward system. It has already been shown that both healthy controls^12,13, and patients with cocaine addiction¹⁴ can learn to regulate SN/VTA activity during real-time functional magnetic resonance imaging (rt-fMRI) neurofeedback training. However, the outcome of primary interest in neurofeedback training is a transfer beyond training itself, i.e., the ability to regulate activity also after training and without feedback. Such transfer is critical for clinical applications, including those involving disorders of the reward system¹⁵. While MacInnes and colleagues¹³ observed significant neural transfer effects in the form of increased neural activity and connectivity of the VTA during transfer on the group level, the other two studies revealed high between-subject variance in this self-regulation success. The purpose of this work is to determine how variance arises, how individuals with successful transfer effects differ from individuals without transfer effects, and whether activity in brain regions other than the VTA characterize individuals with successful transfer. We addressed these issues by combining data from two previous rt-fMRI neurofeedback studies^12,14 and pursuing three aims.

Our first goal was to characterize individual differences in the degree of successful transfer of SN/VTA self-regulation and thereby differentiate regulators from non-regulators. Individual differences in regulation success and high variability of transfer effects arises also in other neurofeedback modalities such as electroencephalography (EEG) and are often neglected¹⁶. For rt-fMRI neurofeedback control, neural activity in the cognitive (or executive) control network may play an important role especially when performing a demanding task such as imagery¹⁷. Therefore, and based on the known direct and indirect connections between prefrontal cortex and SN/VTA^18–21 we hypothesize that successful transfer of SN/VTA regulation is associated with activation in brain regions that are part of the cognitive (executive) control network, especially prefrontal areas.
Our second goal was to determine whether the framework of (operant) associative learning can be used to explain neurofeedback training. In applications of the associative learning framework to neurofeedback^17,22, the feedback provides a higher order reward and the chosen mental strategy is reinforced in proportion to the sign and magnitude of the feedback. At the beginning of the training, participants cannot predict which strategy will consistently lead to an up- or downregulation in brain activity within the target region. Therefore, if they use an adequate strategy, participants receive more reward than predicted corresponding to a positive prediction error. As a consequence, they would be more likely to repeat the strategy, expect higher feedback next time and gradually learn how to keep the feedback signal high. Accordingly, in regulators the size of the prediction error should gradually decrease as the expected feedback increasingly converges with the actual feedback. In contrast, for non-regulators and participants in a control group receiving unrelated or unstable feedback, the prediction errors would remain large and variable because these participants cannot learn any association between mental strategies and feedback. These straightforward implications of current theorizing about the mechanisms underlying neurofeedback remained largely untested (for a simulation study on the temporal dynamics of feedback: Oblak and colleagues²³; for the correlation of BOLD with signal increase (‘success’) and decrease (‘failure’) during regulation: Radua and colleagues²⁴. Here, we directly investigate the prediction error mechanism in regions that control the SN/VTA, which itself has been traditionally associated with the coding of reward prediction errors in both animal^2,25,26 and human research^27,28. Furthermore, the causal sufficiency of dopaminergic prediction error signals for learning has been reinforced by optogenetics^29,30. Together, we hypothesize here that decreasing prediction error signals during neurofeedback learning are associated with successful self-regulation and transfer effects.
Our third aim was to relate individual differences in the ability to regulate the midbrain to characteristics of reward processing, in order to further distinguish regulators from non-regulators. Thus, we asked whether successful neurofeedback training, as measured by transfer effects, taps into general properties of the reward system. Given that adaptive reward processing characterizes the SN/VTA^1,31 we used a variant of the monetary incentive delay (MID) task that captures differences in adaptive reward sensitivity between clinical and non-clinical populations³². Using this task, we tested the hypothesis that reward processing in regions that may control the dopaminergic midbrain is related to successful SN/VTA self-regulation.

In sum, to study individual differences in capability to gain control of the SN/VTA we used rt-fMRI neurofeedback training in healthy participants receiving either real feedback (veridical group) or inverted feedback (control group). We quantified the individual degree of successful transfer by comparing the individual post-training versus pre-training self-regulation capabilities. Moreover, we related individual differences in reward sensitivity to separately measured SN/VTA self-regulation success.

2 Methods

2.1 Participants

Fifty-nine right-handed participants (45 males, average age 28.25±5.25 years) underwent SN/VTA neurofeedback training. We analysed data from two independent projects, which used highly similar rt-fMRI paradigms, rt-fMRI software and scanner hardware. The first dataset¹² comprised male participants, randomly assigned to one of two groups. The experimental group received veridical neurofeedback (N = 15), the control group received inverted neurofeedback (N = 16) as training signal. The second dataset¹⁴ comprised the healthy control participants (N=28, 14 males) of a project investigating also cocaine users (these data are not presented here). This group received veridical neurofeedback. A subset of the participants in the second dataset (N=25) also performed a variant of the monetary incentive delay (MID) task³². All participants provided written informed-consent and received compensation for their participation. The Zurich cantonal ethics committee approved these studies in accordance with the Human Subjects Guidelines of the Declaration of Helsinki.

2.2 Experimental setup and neuroimaging

All participants underwent neuroimaging in a Philips Achieva 3T magnetic resonance (MR) scanner using an eight channel SENSE head coil (Philips, Best, The Netherlands) either at the Laboratory for Social and Neural Systems Research Zurich (SNS Lab, Study 1) or the MR Center of the Psychiatric Hospital of the University of Zurich (Study 2). First, we acquired anatomical images (Study1: gradient echo T1-weighted sequence in 301 sagittal plane slices of 250 × 250 mm² resulting in 1.1 mm³ voxels; Study2: spin-echo T2-weighted sequence with 70 sagittal plane slices of 230 × 184 mm² resulting in 0.57 × 0.72 × 2 mm³ voxel size) prior to neurofeedback training and loaded them into BrainVoyager QX v2.3 (Brain Innovation, Maastricht, The Netherlands) to identify SN/VTA as target region (see 2.4 for details). To acquire functional data, we used 27 ascending transversal slices in a gradient echo T2*-weighted whole brain echo-planar image sequence in both studies. The in-plane resolution was 2 × 2 mm², 3 mm slice thickness and 1.1 mm gap width over a field of view of 220 × 220 mm2, a TR/TE of 2000/35 ms and a flip angle of 82°. Slices were aligned with the anterior–posterior commissure and then tilted by 15°. Functional images were converted from Philips par/rec data format to ANALYZE and exported in real-time to the external analysis computer via the DRIN software library provided by Philips. This external computer ran Turbo BrainVoyager v3.0 (TBV – Brain Innovation, Maastricht, The Netherlands) to extract the BOLD signal from the images and calculate the neural activation for the feedback signal. The visual feedback signal was presented using custom-made software with Visual Studio 2008 (Microsoft, Redmond, WA, USA) through either a mirror mounted at the rear end of the scanner bore (Study 1) or through MR compatible goggles (Study 2).

2.3 Neurofeedback paradigm

The participants were instructed that their goal was to control a reward-related region-of-interest in their brains by imagining rewarding stimuli, actions, or events. We have previously shown that reward imagination activates SN/VTA with conventional fMRI³³. Prior to scanning, we provided examples of such rewards, including palatable food items, motivating achievements, positive experiences with friends and family, favourite leisure activity or romantic imagery. We encouraged participants to use these different rewards as potential strategies for upregulating reward-related activity during the cue ‘Happy Time!’, here referred to as IMAGINE_REWARD condition. In contrast, during the cue ‘Rest’ (here referred to as REST condition), participants were asked to perform neutral imagery, such as mental calculation to reduce reward-related activity. In both conditions, real-time SN/VTA BOLD signal was continuously fed back to the participant visually with a smiley vertically translating proportional to the signal (Figure 1). Prior to training, participants were familiarized with the 5s delay of the hemodynamic response affecting the display of the feedback and were asked not to move or change their breathing during the neurofeedback training.

Figure 1 Neurofeedback paradigm:

(A) All runs consisted of alternating blocks of REST and IMAGINE_REWARD conditions, with each block lasting 20 s. The regulation conditions (REST, IMAGINE_REWARD) were indicated by words (‘Rest’ or ‘Happy Time!’) and the feedback presented as moving smiley face during neurofeedback training runs. The baseline and transfer runs comprised no feedback. The SN/VTA signal difference from these runs served to quantify the degree of regulation transfer (DRT) as (SN/VTA_BOLD_{{IMAGINE_REWARD, Transfer}} − SN/VTA_BOLD_{{REST, Transfer}}) − (SN/VTA_BOLD_{{IMAGINE_REWARD, Baseline}} − SN/VTA_BOLD_{{REST, Baseline}}). (B) Post-processed SN/VTA signal was extracted from the probabilistic atlas mask³⁴.

Each neurofeedback session comprised: a pre-training imagery baseline run without any feedback, three (Study 1) or two (Study 2) training runs during which neurofeedback was presented (as Study 2 also investigated patients, training was limited to two runs), and a transfer run (i.e., without feedback). Each of these runs comprised nine blocks of IMAGINE_REWARD and REST conditions, each lasting 20 s. To determine the current level of the feedback signal we used the average of the last five volumes of the previous REST condition as reference value and employed a moving average of the previous three volumes to reduce noise. In the veridical feedback group, the smiley moved up with increasing percent signal change of SN/VTA BOLD signal and changed colour from red to yellow (Fig. 1 A). In the inverted feedback group, the smiley moved up and turned yellow with a decreasing SN/VTA BOLD signal.

2.4 Region-of-interest SN/VTA

In both studies, the target region for neurofeedback, i.e. the substantia nigra (SN) and ventral tegmental area (VTA), was structurally identified using individual anatomical scans. Since the individual mask definition slightly differed between Study 1 and 2 (T1-weighted scans in Study 1 and T2-weighted scans in Study 2), we used an independent mask for our post-hoc analysis. By this, we can control for individual differences between experimenter ROI selection strategies, to avoid interpolation confounds due to warping by normalization and use a reliable seed region for functional connectivity analysis. Specifically, we used the probabilistic mask of the SN and VTA as defined by³⁵, which is based on a large sample (148 datasets) and available on https://www.adcocklab.org/neuroimaging-tools (download August 2018). Figure 1B illustrates this mask within the brain. From this mask image, we extracted and averaged SN/VTA activity for each participant using custom-made scripts in Matlab R2016b.

2.5 Degree of regulation transfer (DRT)

We assessed the effects of individual differences in performance to characterise participants on a continuous regulation scale. The measure of successful self-regulation was defined as individual degree of regulation transfer (DRT), i.e. as the condition-specific SN/VTA signal difference between post-training (Transfer) and pre-training (Baseline) runs:

Thus, a positive DRT corresponds to a relative increase in post-training SN/VTA BOLD activity compared to pre-training SN/VTA BOLD activity for the contrast IMAGINE_REWARD minus REST. It is essential to note that during these two runs (pre-training baseline, post-training transfer) no neurofeedback was presented. This definition ensures comparability between participants in the different intervention groups, while the perception and processing of the feedback signal during the training runs might be different and influencing the SN/VTA signal itself.

To achieve positive transfer effects, participants had to apply what they had learned during training runs. We therefore asked whether DRT is related to SN/VTA activity during the training runs by calculating the correlation between them. For this, we used the slope of SN/VTA signal change increase over training time in Spearman’s correlations for the intervention and control group.

DRT distributions

To investigate potential group differences in DRT, we transferred the extracted data to R (R-project R3.4.1). Using an ANOVA and a non-parametric Kruskal-Wallis test, we tested for differences of the mean between the three groups (i.e. the two groups receiving veridical feedback in Studies 1 and 2 and the control group receiving inverted feedback in Study 1).

DRT in fMRI analysis

The DRT measure served to investigate the individual differences in successful transfer at the whole brain level. In particular, we were interested to identify regions that were positively associated with DRT and thus potentially contribute to regulation of the SN/VTA. For this analysis, we entered mean-centered individual DRT levels in all fMRI second level statistical models (see 2.8). We excluded SN/VTA from all analyses to avoid any circularity.

Spatial specificity control analysis

To investigate the spatial specificity of our analysis of dopaminergic midbrain regulation, we performed the same whole brain analysis as described above for SN/VTA with a different ROI. Specifically, we used the neighboring brain region of the parahippocampus (Supplemental Material). In keeping with specificity, this control analysis revealed little commonality (limited to the cerebellum and temporal gyrus) with the SN/VTA analysis (Figure S4 and Table S8).

2.6 MID Task

In addition to the neurofeedback training, the participants in Study 2 (N=25) performed a MID task that captures differences in adaptive reward sensitivity. In every trial of the MID task^32,36,37 first one of three cues appeared (Fig. S1). One cue was associated with large reward (ranging from 0 to 2.00 CHF), one cue with small reward (0 to 0.40 CHF) and one cue with no reward. After a delay of 2.5 to 3 s, participants had to identify an outlier from three circles by pressing one of three buttons as quickly as possible. Depending on the cue, their response time and the correctness of the answer, participants gained an amount of money. Importantly, the use of large and small reward ranges enables investigation of individual differences not only in general reward sensitivity but also in how well the reward system adapts to different reward distributions, so-called adaptive reward coding³².

2.7 MR Data pre-processing

We despiked the functional data using AFNI toolbox (National Institute of Mental Health; http://afni.nimh.nih.gov/afni). To account for differences in echo-planar-image (EPI) slice acquisition times we employed temporal interpolation of the MR signal, shifting the signal of the misaligned slices to the first slice³⁸ using FSL 5 (FMRIB Software Library, Analysis Group, FMRIB, Oxford, http://fsl.fmrib.ox.ac.uk). Furthermore, data were bias-field corrected using ANTs (Advanced Normalization Tools; http://stnava.github.io/ANTs), realigned using FSL 5, normalized to standard Montreal Imaging Institute (MNI) space using ANTs in combination with a custom scanner-specific EPI-template resulting in a 1.5 mm³ isotropic resolution and finally smoothed with a 6 mm full-width-half-maximum Gaussian kernel using FSL 5.

The spatial specificity control analyses (Figure S4 and Table S8) suggest that the findings reported here are not due to common physiological noise. To more directly account for noise, we additionally acquired physiological data in a subsample of participants. In the available subsample, neither changes in heart rate variability nor respiration were significantly correlated with VTA/SN activation during reward imagination (see details in Kirschner et al.³⁹, Supplemental Material Table S1, Figure S1). Here, we also used an image-based correction to account for physiological artefacts in all participants. Since physiological artefacts are most prominently present in CSF and white matter due to the absence of BOLD effects, pulsations of the ventricles, and proximity to the large brain arteries (e.g., circle of Willis), we decided to use an established preprocessing procedure based on a principal component analysis (PCA) approach^40,41. Specifically, we calculated the global mean and the first 6 components of a temporal principal component analysis on the cerebrospinal fluid and white matter signal. These 6 components were used as noise regressors in the first level statistics (see 2.8) in addition to the 6 motion parameters. Along with the pre-processing of the fMRI data, the SN/VTA mask used as ROI for the analysis was resliced into the dimensions of the functional data using SPM 12 (v6906, Wellcome Trust Centre for Neuroimaging, UCL, London, UK; http://www.fil.ion.ucl.ac.uk/spm/software/spm12/) within Matlab R2016b (Mathworks, Sherborn, MA, USA).

2.8 MR Data analysis

For all of the following analyses, we used the toolbox SPM 12 (v6906) within Matlab R2016b. All figures were created using bspmview v.20161108⁴² and ggplot2 within R 3.4.1. All group-level analysis included an additional covariate for the dataset to account for potential global signal differences between studies.

Post-training effects: Correlation with DRT in veridical and inverted feedback group (aim 1)

The first question of this study asked whether the individual degree of successful neurofeedback transfer is associated with individual differences in the cognitive control network. To answer this question, we conducted a general linear model (GLM) on the single subject level including one block-wise regressor for the IMAGINE_REWARD condition and one for the REST condition with 190 timesteps (each condition comprised 9 onsets and lasted 20 s) for each of the four runs separately. Additionally, we modelled the first 5 TRs of every run as nuisance regressor and added also motion and physiological artefact regressors (see section 2.7) in the design matrix. In total the GLM consisted of fifteen regressors. We formed the contrast IMAGINE_REWARD-REST and compared it between Transfer and Baseline runs, i.e. (IMAGINE_REWARD-REST)_Transfer − (IMAGINE_REWARD-REST)_Baseline. At the group level, we tested for correlation of the SN/VTA-derived DRT with this contrast in a one-sample t-test. We ran these analyses in all voxels other than the SN/VTA and separately for both the veridical and inverted feedback groups. To test for common and separate activity between the groups, we performed conjunction and disjunction analyses over the two group maps. Additionally, we performed a two-sample t-test group comparison analysis to identify significant group differences. To identify activity within the cognitive control network, we used a cognitive control template based on the coordinates from a meta-analysis⁴³. We created this template with fslmaths and spheres of 15 mm around all coordinates from the meta-analysis. In table S1 we identify regions of the cognitive control network where transfer success correlates with DRT within the template. For statistical maps, we used FWE-corrected cluster level threshold with p < 0.05 (cluster extent of 230 voxel) based on whole brain statistics p < 0.001. In addition, to test the functional specificity of our results, we performed a meta-analytic functional decoding analysis using the Neurosynth database (www.neurosynth.org). This relates the neural signatures of the cognitive control decoding network to other task-related neural patterns (Fig. S2).

Prediction error coding analysis during NF training (aim 2)

The second question of the study asked whether successful neurofeedback performance was associated with a reduction in prediction error during the training runs as captured by a classic reinforcement learning framework. To address this issue, we determined the temporal difference of the feedback signal (i.e., the change in height of the smiley) as proxy for the prediction error signal. Specifically, for the neurofeedback training runs we constructed a GLM that replaced the block-level (IMAGINE_REWARD and REST) regressors with corresponding event-level regressors that modelled every TR and that we parametrically modulated with a time-resolved continuous prediction error (PE) term. This PE term was defined as difference between the current and the previous TR within the SN/VTA mask, i.e. (BOLD_SN/VTA_t- BOLD_SN/VTA_t−1; accordingly, in the upregulation condition the parametric modulator corresponded to IMAGINE_REWARD_t ‒IMAGINE_REWARD_t−1). To investigate if the prediction error decreases over time, we used the difference (parametric modulator PE (run 2) – parametric modulator PE (run 1), i.e. PE coding in neurofeedback training run 2 minus neurofeedback training run 1 (Figure 1A). This difference should become negative as prediction errors decrease with learning. On the group level, we correlated this contrast (difference in PE coding run2 – PE coding run1) with the DRT measure in a one-sample t-test to test for associations between a decrease in prediction error coding and successful self-regulation.

The results of this analysis, showing prediction error coding in the dorsolateral prefrontal cortex (dlPFC), inspired a functional connectivity analysis. Specifically, we investigated the functional impact of the dlPFC prediction error signal on the SN/VTA using a psychophysiological interaction analysis using the gPPI v13 Toolbox⁴⁴ based on the MNI coordinate of dlPFC (x=40, y=10, z=38) with a 5 mm sphere as seed region. We added activity from this seed region as physiological regressor to the original GLM and interacted it with both the IMAGINE_REWARD and REST regressors to form interaction regressors. Functional connectivity was calculated by contrasting the interaction terms IMAGINE_REWARD-REST between second and first neurofeedback training run. We then correlated this contrast with DRT. The results were focused to the SN/VTA region as target. For statistical maps, we used a whole-brain threshold of p < 0.001 (20 voxel extent).

Relation between DRT and reward sensitivity in the MID Task (aim 3)

To address the third aim of the study, we investigated the relationship between reward processing in the MID task and the capacity to successfully regulate the SN/VTA in the neurofeedback experiment. In particular, we considered two contrasts in the MID task (1) general reward sensitivity, defined as the sum of parametric modulators: small plus large reward (2) adaptive reward coding, defined as the difference between parametric modulators: small minus large reward. Again, we used correlation analysis at the group level to determine whether these two contrasts are related with individual SN/VTA transfer success (DRT) in the neurofeedback task. Moreover, to assess the commonalities of the neural activities in these different tasks, we performed a conjunction analysis of contrasts (1), (2) and the correlation of transfer-activity with DRT (see 2.8). For statistical maps, we used a whole-brain threshold of p < 0.001 (20 voxel extent due to conjunction).

2.9 Additional behavioral measurements

Strategies

All participants were introduced to five example strategies (see 2.3) that they could use to upregulate brain activity but also free to use their own strategies. At the end of the experiment, participants filled in a custom-made questionnaire on the strategies they used. To compare strategies between the groups, we used a χ2-test to assess differences in the distribution of strategy usage. We did not observe any significant group differences in strategy use (p = .9), and therefore did not consider this measurement in any further analysis.

Personality measures

To investigate whether individual differences in behavior and personality were associated with individual differences in DRT, Study 2 measured: (1) Smoking status in number of cigarettes per day; (2) verbal IQ as determined by the Multiple Word Test (MWT⁴⁵); (3) Positive and Negative Affect Score (PANAS) in the German version⁴⁶; (4) attentional and nonplanning subscores of the Barratt Impulsivity Scale in the German version⁴⁷. We tested for correlations with the DRT parameter using Pearson correlations. As none of these variables correlated significantly with the DRT parameter (all p > 0.5), we did not consider them further.

3 Results

3.1 No difference in degree of regulation transfer (DRT) across groups

We first evaluated the DRT measure and compared it between the three datasets. There were no significant differences across all three groups (mean DRT veridical group Study 1 = 0.01, mean DRT veridical group Study 2 = −0.02, mean DRT inverted group Study 1= −0.05; anova testing F(2, 56) = 0.13; non-parametric Kruskal-Wallis testing H(2) = 0.39, p = 0.82; Fig. 1; see Supplemental Figure S5 for alternative illustration). Moreover, also the direct comparison between the two veridical groups was not significant (T(39) = −0.26, p = 0.8). Accordingly, we combined the two veridical groups for subsequent analyses. Importantly, our participants showed considerable variation in DRT, which allowed us to investigate the individual differences in brain activity accompanying more or less successful regulation of the SN/VTA through neurofeedback. Thus, the groups showed similar mean levels and considerable individual differences in self-regulation success.

3.2 Correlation of slopes between transfer and training only for intervention group

Next, we tested for differences between groups in the relationship of SN/VTA transfer as measured by DRT with signal change in SN/VTA during training. We found positive correlations between the slope of SN/VTA signal change during training period and DRT only for the veridical feedback group, but not for the control group (veridical group ρ = 0.62, p < 0.001; inverted group Rho = −0.3, p = 0.25). Although the comparability between training and transfer is limited due to the feedback signal processing, this indicates that only the veridical feedback group benefitted from the feedback. More importantly, within the veridical feedback group, particularly those individuals who were more successful at transfer also showed stronger upregulation during training.

3.3 Individual variation in transfer: DRT associated with cognitive control network in veridical and amygdala activity in inverted feedback group

3.3.1 Veridical feedback group

We investigated whether individual levels of successful SN/VTA self-regulation (DRT) were associated with increased post-training activity compared to pre-training activity in other regions of the brain (IMAGINE_REWARD-REST)_Transfer − (IMAGINE_REWARD-REST)_Baseline). This analysis revealed several areas consistently reported by neurofeedback studies (see Fig. 2 in the meta-analysis of Sitaram et al.¹⁷, including dorsolateral prefrontal cortex (dlPFC), anterior cingulate cortex (ACC), lateral occipital cortex (LOC), and thalamus (Figure 3A and Table 1). To formally test for a more general association with the cognitive control network, we applied a cognitive control network template from a meta-analysis⁴³, which in addition revealed neural activity in precuneus and striatum (Fig. 3B for exemplary illustrations of dlPFC, ACC, temporal gyrus, and thalamus activity; Table S1 for full overview). Thus, regions of the cognitive control network showed transfer to the extent that neurofeedback training of the dopaminergic midbrain was successful.

View this table:

Table 1:

Correlation of transfer activity (IMAGINE_REWARD_transfer − REST_transfer) − (IMAGINE_REWARD_baseline − REST_baseline) with DRT in veridical feedback group (see Figure 3a). Table shows all local maxima separated by more than 20 mm; for all clusters, p < 0.05 FWE-corrected on cluster level; df = 40. Regions were labelled using the Harvard-Oxford atlas and/or the Anatomy Toolbox in parentheses; the activity in SN/VTA has been excluded from the table to avoid circularity; x,y,z = Montreal Neurological Institute (MNI) coordinates in the left-right, anterior-posterior, and inferior-superior dimensions, respectively.

Figure 2 Distribution of DRT across groups.

The DRT measure was distributed similarly in both groups receiving veridical feedback in Studies 1 and 2 and the control group receiving inverted feedback in Study Accordingly, we found no evidence supporting a main effect of feedback on transfer. However, DRT varied substantially across individuals, which motivated the analyses using the individual self-regulation success.

Figure 3: Correlation of DRT with transfer success after training in veridical feedback group:

To investigate whole-brain neural activity correlating with successful SN/VTA self-regulation, we used DRT as measure of successful regulation of the SN/VTA and correlated it with the contrast (IMAGINE_REWARD_transfer − REST_transfer) − (IMAGINE_REWARD_baseline − REST_baseline) as measure of learning related change in neural activity in the rest of the brain. A) The analysis revealed task-specific correlations primarily within the cognitive control network (whole brain overview FWE-corrected with p < 0.05 on cluster level, projected to lateral and medial sagittal sections). B) Exemplary correlations within the cognitive control network have been depicted, here in MFG/dlPFC, ACC, Thalamus, and bilateral Temporal Gyrus, to illustrate the association between neural activity with DRT. The correlations are for illustration purposes only without further significance testing to avoid double dipping. The grey shaded area identifies 95 % confidence interval.

3.3.2 Inverted feedback group

For the inverted feedback group, the same analysis resulted in partly distinct activations. In contrast to the veridical feedback group, left amygdala activity correlated significantly with DRT (Fig. 4 and Table S2). Importantly, activity in cognitive control areas reported above, such as dlPFC and ACC, was significantly weaker in inverted than veridical feedback groups (Table S3 for disjunction and direct statistical comparison). Together with the lack of correlation of DRT with SN/VTA signal change during training for the inverted feedback group, these findings suggest that cognitive control regions play a preferential role for successful transfer of SN/VTA self-regulation.

Figure 4 Correlation of DRT with transfer success after training in inverted feedback group:

(A) Receiving inverted feedback resulted in a correlation between DRT as measure of regulation success and the contrast (IMAGINE_REWARD_transfer − REST_transfer) − (IMAGINE_REWARD_baseline − REST_baseline) as measure of learning related change in the amygdala (p < 0.001). This region was not observed in the veridical group. (B) The correlation depicts the positive association of neural activity in the amygdala with DRT. The plot is for illustration purposes only without further significance testing to avoid double dipping. The grey shaded area identifies the 95 % confidence interval.

We also tested for common activity in the two feedback groups using conjunction analysis. Similar to the veridical group, the inverted feedback group showed correlations between DRT and activity in the precuneus, middle temporal gyrus, insula, thalamus, and parahippocampal gyrus (Table S4). These common areas appear to reflect non-specific regulation activity and may be associated with memory and introspection processes.

3.4 Reinforcement learning: DLPFC prediction error coding during neurofeedback training correlates with DRT

To investigate whether reinforcement learning mechanisms contribute to successful neurofeedback transfer, we tested for the temporal differences in the feedback signal as proxy for the prediction error signal during the training runs. We reasoned that prediction error activity should decrease from early to late phases of neurofeedback training for successful regulators. At any time during neurofeedback training, participants needed to come up with their own predictions of the upcoming feedback signal and compare the predictions with actual feedback at the next time point. Similarly, in temporal difference learning models, prediction errors are calculated at each moment in time⁴⁸. Therefore, we operationalized prediction error by subtracting the immediately preceding SN/VTA activity (prediction) from the present SN/VTA activity (outcome). Specifically, we tested for a negative correlation of DRT with the difference in prediction error coding signals between late and early training. In other words, only for participants with high DRT we expected to observe a decrease of prediction error signal over the course of the neurofeedback training. We found such gradually decreasing prediction error signals in dlPFC (Fig. 5 and Table S5). To interrogate the finding in detail, we also analysed the two neurofeedback training runs separately. This analysis confirmed that only successful regulators showed less pronounced dlPFC coding of prediction error in late compared to early training (see Fig. S3 for run-wise PE coding in dlPFC). Importantly, it should be noted that this decrease of error signals in dlPFC is related to the individual DRT levels. The basic contrast of prediction error coding, i.e. without correlation to DRT, revealed striatal activity.

Figure 5: Prediction error coding in dlPFC decreases during NF training in participants with successful SN/VTA self-regulation:

(A) The neural prediction error signal, corresponding to the temporal difference between the current and immediately preceding feedback activity from the SN/VTA decreased with ongoing feedback training (i.e, the difference between the last and first run) within dlPFC more strongly in individuals with higher DRT (p < 0.001). This finding is consistent with reinforcement learning theories, according to which prediction errors decrease as learning progresses. By extension, a reinforcement learning framework can explain successful neurofeedback training. (B) The plot depicts the differences in prediction error signals in dlPFC between the last and first training for every participant. This shows that the individual degree of regulation success statistically relates to the decrease in prediction error coding over training. The plot is for illustration purposes only without further significance testing to avoid double dipping. The grey shaded area identifies the 95 % confidence interval.

3.5 Learning-related functional coupling of DLPFC with SN/VTA

Following on from our finding of decreasing prediction error coding in dlPFC being related to individual success of regulating the dopaminergic midbrain, we performed a functional connectivity analysis to investigate whether the identified dlPFC region communicates with the SN/VTA region our participants aimed to regulate. Thus, we used the dlPFC region showing decreasing prediction error coding during training particularly in successful regulators as a seed region and investigated the coupling to the SN/VTA. Functional connectivity between the two regions increased with transfer success (Fig. 6; (t(40) = 3.79, cluster extent = 16, MNI × = −2, y = −16, z = −15). In other words, DRT and dlPFC to SN/VTA connectivity correlated positively. Note that this correlation of DRT with dlPFC-SN/VTA connectivity was task-related as it was enhanced during IMAGINE_REWARD relative to REST (which served as psychological regressor) and independent of SN/VTA activity.

Figure 6 Functional connectivity between dlPFC and SN/VTA correlates with transfer success:

(A) A functional connectivity analysis based on the prediction error coding seed region in the dlPFC (MNI coordinate 40, 10, 38, 5 cm sphere) revealed that connectivity with the SN/VTA correlated positively with success of neurofeedback training (p < 0.001). (B) DRT increased with increasing connectivity between dlPFC and SN/VTA during IMAGINE_REWARD vs. REST in neurofeedback training runs. Thus, dlPFC appears to communicate with SN/VTA in proportion to the degree to which neurofeedback training is successful. (C) The correlation plot depicts connectivity between dlPFC and SN/VTA with DRT. The plot is for illustration purposes only without further significance testing to avoid double dipping. The grey shaded area identifies the 95 % confidence interval.

3.6 Individual differences in dlPFC reward sensitivity during MID task correlate with regulation success

In Study 2 we used the MID task to independently measure reward sensitivity and the capability to adapt to different reward contexts³². We asked whether individual measures of reward processing (measured with parametric and adaptive coding of reward related BOLD activity) are related to individual success in regulating the SN/VTA. Specifically, we tested for correlations between DRT and (i) MID reward sensitivity (sum of small and large reward parametric modulators) and (ii) MID adaptive reward coding (difference of small minus large reward parametric modulators). These two correlations both identified dlPFC (Fig. 7A). Moreover, a conjunction of these two correlations with the correlation between DRT and the contrast (IMAGINE_REWARD_transfer − REST_transfer) − (IMAGINE_REWARD_baseline − REST_baseline) outside SN/VTA revealed common neural activity in the dlPFC (center at MNI x = 40, y = 10, z = 38; Fig. 7 and Table S6). Thus, the more successful individuals were at self-regulating SN/VTA as a result of neurofeedback training, the more sensitive they were to reward and the more strongly they adapted to different reward contexts in the MID task.

Figure 7 Reward-sensitivity in dlPFC correlates with successful SN/VTA self-regulation:

(A) Degree of successful SN/VTA transfer (DRT) in the neurofeedback task correlated with prefrontal reward sensitivity and adaptive coding in the MID task. A conjunction analysis around the peak coordinate in dlPFC showing DRT-related decreases in prediction error coding during neurofeedback training (MNI x = 40, y = 10, z = 38, left) revealed common neural activity reflecting transfer (IMAGINE_REWARD_transfer − REST_transfer) − (IMAGINE_REWARD_baseline − REST_baseline) and reward sensitivity (small + large reward magnitude parametric modulators in MID, all contrasts with p<0.001). Moreover, individuals with more successful self-regulation of the SN/VTA showed stronger adaptive reward coding (which reflects higher sensitivity to small relative to large rewards) in the same region that also showed DRT-related decreases in prediction error coding during neurofeedback training (right). (B) The correlation plot depicts adaptive reward coding activity in dlPFC with DRT. The plot is for illustration purposes only without further significance testing to avoid double dipping. The grey shaded area identifies the 95 % confidence interval.

4 Discussion

In the present work, we used data acquired from two previous rt-fMRI neurofeedback studies to characterize individual differences and processes underlying successful transfer of self-regulation of the dopaminergic midbrain after neurofeedback training. This novel perspective on self-regulation success revealed insights on what distinguishes individuals who were more successful at SN/VTA regulation from those who were less successful. We found a significant relation between self-regulation success and increases in post-training activity in the cognitive control network. Moreover, we found four correlations with increasing transfer effects: (i) decreasing dlPFC prediction error signals during neurofeedback training, (ii) increasing connectivity of dlPFC with the SN/VTA for reward imagination compared to rest during transfer, (iii) increasing reward sensitivity in dlPFC and (iv) increasing adaptive reward coding in dlPFC in the independent MID task. Together, our study first shows that neurofeedback control of the dopaminergic midbrain relies on the cognitive control network. Second, our study suggests that the predictability of the upcoming feedback as reinforcement learning signal contributes to successful neurofeedback training.

Sustained self-regulation skills and the generalization of learning after neurofeedback training are key elements for practical applications and remain one of the major challenges in rt-fMRI neurofeedback research⁴⁹. Results from previous neurofeedback studies of the reward system have been inconclusive^12,50,51 and only one study¹³ reported significant post-training activity in the VTA, and increased mesolimbic network connectivity. Methodological limitations might have hampered the ability to detect transfer effects. First, previous studies focused exclusively on self-regulation of one a priori target region, such as SN/VTA, instead of investigating large-scale post-training effects within the whole brain. Second, transfer effects were examined at the group-level, which did not reflect the individual learning success. In the present study we overcome both limitations by taking advantage of an individual measure of transfer success (DRT) and focusing on the whole brain.

One insight of the present study is that transfer success associates with neural activity in cognitive control network areas^43,52, such as dlPFC and ACC. The lack of cognitive control engagement within the control group and the correlation of DRT with the slope of SN/VTA increase during training in the intervention group only underpins that this finding is specific for the successful transfer of the learned self-regulation procedure. This network overlaps with regions that have been associated with feedback-related information processing during training^53,54. Together, these findings suggest that the same regions contribute to acquisition and transfer of neurofeedback and that sustained post-training self-regulation generalizes across a functional network of different brain regions. Intriguingly, similar networks have been reported in skill learning. Future studies might investigate commonalities between neurofeedback and particularly cognitive skill learning, taking into account the specific temporal dynamics of both functions^22,55.

The finding that individuals with more successful regulation of the dopaminergic midbrain show stronger activation of cognitive control areas during transfer speaks to our understanding of how individual differences in cognitive control affect emotion regulation^56–59. For example the working memory component of cognitive control has been shown to predict negative affect reduction through reappraisal and suppression⁶⁰. Interestingly, dopamine action (particularly at D1 receptors) in dlPFC sustains working memory performance⁶¹. Thus, it is conceivable that frontolimbic loops contribute to successful transfer. In any case, this notion converges with our finding of dlPFC-SN/VTA coupling being related to regulation success.

Future research might explore whether our findings, the positive post-training effects on the cognitive-control network activity, also have implications for transdiagnostic clinical applications. First, combining rt-fMRI neurofeedback training with different forms of psychotherapy such as cognitive behavioral therapy⁶², dialectical behavioral therapy⁶³, or psychodynamic therapy^64–66 could improve emotion regulation deficits prevalent in several psychiatric disorders including substance use disorders, depression, anxiety and personality disorders. It has already been shown that in patients suffering from depression, neurofeedback training can be a successful tool to re-stabilize modulation of the amygdala and increase its responsivity to reward⁶⁷. It remains a question for future patient studies if a re-stabilization of cognitive control is also possible via such a training. With particular attention to substance use disorders, maladaptive changes in neuroplasticity within the cognitive control network are closely associated with loss of control and compulsive drug-seeking^68–70. In these patients, neurofeedback training might be able to directly target the biological correlates and reinstate function of the cognitive-control network.

We found a reduction in prediction error coding in the DLPFC over the course of the neurofeedback training for successful regulators only, while these prediction error signals remained high for non-regulators. This finding suggests that prediction error-driven reinforcement learning was more pronounced in regulators than non-regulators and provides empirical evidence for previous theoretical proposals on the principles of neurofeedback learning independent of feedback modality²². Thus, reinforcement learning provides a framework for understanding how neurofeedback works. Future research may want to investigate whether the rich theoretical and empirical tradition of reinforcement learning⁷¹ can be harnessed to facilitate neurofeedback training.

We found that successful SN/VTA self-regulation is associated with an increased functional coupling between dlPFC regions coding prediction error and the dopaminergic midbrain. This coupling fits well with anatomical connections between dlPFC and the dopaminergic midbrain^19,21 as well as effective connectivity studies on motivation⁷² and animal studies on prefrontal regulation of midbrain activity^20,73. The animal work suggests that prefrontal cortex communicates with dopaminergic neurons primarily indirectly, through inhibitory relay neurons. By relating this coupling to successful midbrain self-regulation, our data go beyond previous connectivity studies of the dopamine system, which primarily focused on coupling between the prefrontal cortex and the striatum^74–76.

At the functional level, a recent study on creative problem solving in humans highlights that dlPFC is involved in experiencing a moment of insight⁷⁷. According to this effective connectivity study, dlPFC could upregulate the VTA/SN via striatal connections during such a moment. On the other hand, in trials where no solution was found for a given problem, also no significant connectivity was observed. This study supports our finding that dlPFC-SN/VTA connectivity plays an important role in self-guided motivation and in internal reward processing. Our finding points to the possibility that cognitive and affective mechanisms associated with different experiences also involve different neural pathways. Future studies should investigate to what degree individual differences in the functional architecture of brain networks⁷⁸ influence these internal reward mechanisms and to which degree different strategies can influence neurofeedback training success.

Our independent reward task revealed that individual differences in prefrontal reward sensitivity and efficient adaptive reward coding were associated with successful SN/VTA self-regulation. Adaptive coding of rewards captures the notion that neural activity (output) should match the most likely inputs to maximize efficiency and representational precision⁷⁹. Accordingly, we previously showed that reward regions encode a small range of rewards more sensitively than the large range of rewards^37,80. Interestingly, in the present study, participants who were more sensitive to small rewards were also more successful in self-regulation of the dopaminergic midbrain. When participants in a typical neurofeedback training paradigm succeed at increasing the activity of the self-regulated area, the ensuing change in visual stimulation (positive neurofeedback) may constitute a small reward. By extension, adaptive reward coding may therefore provide a useful handle on identifying regulators. Moreover, future neurofeedback experiments should consider scaling the feedback signal to avoid sensitivity limitations, particularly in individuals with reduced adaptive coding.

A potential limitation of our study is that we used a combined mask for SN and VTA even though differences in functionality and anatomy have been reported for the two regions (reviewed e.g. by Trutti et al.⁸¹), with the SN more related to motor functions and the VTA to reward functions. However, it should be kept in mind that when viewed through the lens of recording and imaging rather than lesion techniques the differences are more gradual than categorical⁸². Still, future studies may want to use more specific feedback from one or the other region to more specifically target potential differences in functions. Further limitations are that only inverted feedback is available here as control group and this group has a smaller sample size. An additional control group perceiving no feedback could help to judge effects of neurofeedback training more precisely. Still, our data show a significant correlation between degree of regulation transfer and training runs only for the veridical feedback group and not for the control group. Moreover, it has been shown in other neurofeedback studies that volitional self-regulation of brain activity can only be learned when real feedback is presented⁸³ and that other control groups failed to acquire VTA self-regulation¹³.

5 Conclusions

We showed that successful transfer in SN/VTA self-regulation after neurofeedback training is associated with activity in the cognitive control network (particularly dlPFC). Future studies could employ cognitive control activity during neurofeedback training to boost success rates and clinical outcomes. Furthermore, our findings of decreasing prediction error signals in dlPFC suggest that associative learning contributes to real-time fMRI neurofeedback effects. Finally, we show that higher individual reward sensitivity at the neural level increases the chance of neurofeedback training success. Patients with reduced neural reward sensitivity may therefore benefit from careful scaling of the neurofeedback information.

Acknowledgments

The authors would like to thank Silvia Maier and Stephan Nebe for comments on previous versions of the manuscript and fruitful discussions. This project was supported by the European Union’s Horizon 2020 research and innovation program under the Grant Agreement No 794395 (to LH) and grant 100014_165884 from the Swiss National Science Foundation (to PNT). MK received grant support from the National Bank Fellowship (McGill) and Swiss National Science Foundation (P2SKP3_178175). The authors declare no competing financial interests.

Footnotes

- Description of the usage of individual success measure has been updated to clarify that all analysis here are investigating correlations of successful self-regulation - The introduction updated to clarify prediction error coding analysis

References

1.↵
Schultz, W. Predictive Reward Signal of Dopamine Neurons. J. Neurophysiol. 80, 1–27 (1998).
OpenUrl CrossRef PubMed Web of Science
2.↵
Schultz, W. Dopamine reward prediction error coding. Dialogues Clin. Neurosci. 18, 23–32 (2016).
OpenUrl CrossRef PubMed
3.
Burke, C. J. & Tobler, P. N. Time, Not Size, Matters for Striatal Reward Predictions to Dopamine. Neuron 91, 8–11 (2016).
OpenUrl
4.↵
Tobler, P. N., Fletcher, P. C., Bullmore, E. T. & Schultz, W. Learning-Related Human Brain Activations Reflecting Individual Finances. Neuron 54, 167–175 (2007).
OpenUrl CrossRef PubMed Web of Science
5.↵
Bromberg-Martin, E. S., Matsumoto, M. & Hikosaka, O. Dopamine in Motivational Control: Rewarding, Aversive, and Alerting. Neuron 68, 815–834 (2010).
OpenUrl CrossRef PubMed Web of Science
6.↵
Wise, R. A. Dopamine, learning and motivation. Nature Reviews Neuroscience 5, 483–494 (2004).
OpenUrl CrossRef PubMed Web of Science
7.↵
Friston, K. et al. The anatomy of choice: Dopamine and decision-making. Philos. Trans. R. Soc. B Biol. Sci. 369, (2014).
8.↵
Huys, Q. J. M., Tobler, P. N., Hasler, G. & Flagel, S. B. The role of learning-related dopamine signals in addiction vulnerability. in Progress in Brain Research 211, 31–77 (2014).
OpenUrl CrossRef PubMed
9.↵
Deserno, L., Schlagenhauf, F. & Heinz, A. Striatal dopamine, reward, and decision making in schizophrenia. Dialogues Clin. Neurosci. 18, 77–89 (2016).
OpenUrl
10.↵
Maia, T. V. & Frank, M. J. An Integrative Perspective on the Role of Dopamine in Schizophrenia. Biol. Psychiatry 81, 52–66 (2017).
OpenUrl
11.↵
Meder, D., Herz, D. M., Rowe, J. B., Lehéricy, S. & Siebner, H. R. The role of dopamine in the brain - lessons learned from Parkinson’s disease. Neuroimage 190, 79–93 (2019).
OpenUrl
12.↵
Sulzer, J. et al. Neurofeedback-mediated self-regulation of the dopaminergic midbrain. Neuroimage 75, 176–184 (2013).
OpenUrl
13.↵
MacInnes, J. J., Dickerson, K. C., Chen, N. kuei & Adcock, R. A. Cognitive Neurostimulation: Learning to Volitionally Sustain Ventral Tegmental Area Activation. Neuron 89, 1331–1342 (2016).
OpenUrl
14.↵
Kirschner, M. et al. Self-regulation of the Dopaminergic Reward Circuit in Cocaine Users with Mental Imagery and Neurofeedback. bioRxiv (2018).
15.↵
Klein, M. O. et al. Dopamine: Functions, Signaling, and Association with Neurological Diseases. Cell. Mol. Neurobiol. 39, 31–59 (2019).
OpenUrl CrossRef
16.↵
Alkoby, O., Abu-Rmileh, A., Shriki, O. & Todder, D. Can We Predict Who Will Respond to Neurofeedback? A Review of the Inefficacy Problem and Existing Predictors for Successful EEG Neurofeedback Learning. Neuroscience 378, 155–164 (2018).
OpenUrl
17.↵
Sitaram, R. et al. Closed-loop brain training: the science of neurofeedback. Nat. Rev. Neurosci. 18, 86–100 (2016).
OpenUrl CrossRef
18.↵
Wu, J. et al. Cortical control of VTA function and influence on nicotine reward. Biochemical Pharmacology 86, 1173–1180 (2013).
OpenUrl
19.↵
Frankle, W. G., Laruelle, M. & Haber, S. N. Prefrontal cortical projections to the midbrain in primates: Evidence for a sparse connection. Neuropsychopharmacology 31, 1627–1636 (2006).
OpenUrl CrossRef PubMed Web of Science
20.↵
Gao, M. et al. Functional Coupling between the Prefrontal Cortex and Dopamine Neurons in the Ventral Tegmental Area. J. Neurosci. 27, 5414–5421 (2007).
OpenUrl Abstract/FREE Full Text
21.↵
Sesack, S. R., Carr, D. B., Omelchenko, N. & Pinto, A. Anatomical Substrates for Glutamate-Dopamine Interactions: Evidence for Specificity of Connections and Extrasynaptic Actions. in Annals of the New York Academy of Sciences 1003, 36–52 (John Wiley & Sons, Ltd (10.1111), 2003).
OpenUrl CrossRef PubMed Web of Science
22.↵
Birbaumer, N., Ruiz, S. & Sitaram, R. Learned regulation of brain metabolism. Trends Cogn. Sci. 17, 295–302 (2013).
OpenUrl CrossRef PubMed
23.↵
Oblak, E. F., Lewis-Peacock, J. A. & Sulzer, J. S. Self-regulation strategy, feedback timing and hemodynamic properties modulate learning in a simulated fMRI neurofeedback environment. PLoS Comput. Biol. 13, e1005681 (2017).
OpenUrl
24.↵
Radua, J., Stoica, T., Scheinost, D., Pittenger, C. & Hampson, M. Neural Correlates of Success and Failure Signals During Neurofeedback Learning. Neuroscience 378, 11–21 (2018).
OpenUrl
25.↵
Ferenczi, E. A. et al. Prefrontal cortical regulation of brainwide circuit dynamics and reward-related behavior. Science (80-.). 351, aac9698–aac9698 (2016).
OpenUrl Abstract/FREE Full Text
26.↵
Tsai, H. C. et al. Phasic firing in dopaminergic neurons is sufficient for behavioral conditioning. Science (80.). 324, 1080–1084 (2009).
OpenUrl Abstract/FREE Full Text
27.↵
D’Ardenne, K., McClure, S. M., Nystrom, L. E. & Cohen, J. D. BOLD responses reflecting dopaminergic signals in the human ventral tegmental area. Science (80.). 319, 1264–1267 (2008).
OpenUrl Abstract/FREE Full Text
28.↵
Zaghloul, K. A. et al. Human substantia nigra neurons encode unexpected financial rewards. Science (80.). 323, 1496–1499 (2009).
OpenUrl Abstract/FREE Full Text
29.↵
Steinberg, E. E. et al. A causal link between prediction errors, dopamine neurons and learning. Nat. Neurosci. 16, 966–973 (2013).
OpenUrl CrossRef PubMed
30.↵
Keiflin, R., Pribut, H. J., Shah, N. B. & Janak, P. H. Ventral Tegmental Dopamine Neurons Participate in Reward Identity Predictions. Curr. Biol. 29, 93–103.e3 (2019).
OpenUrl CrossRef
31.↵
Tobler, P. N., Fiorillo, C. D. & Schultz, W. Adaptive coding of reward value by dopamine neurons. Science 307, 1642–5 (2005).
OpenUrl Abstract/FREE Full Text
32.↵
Kirschner, M. et al. Deficits in context-dependent adaptive coding in early psychosis and healthy individuals with schizotypal personality traits. Brain 141, 2806–2819 (2018).
OpenUrl
33.↵
Miyapuram, K. P., Tobler, P. N., Gregorios-Pippas, L. & Schultz, W. BOLD responses in reward regions to hypothetical and imaginary monetary rewards. Neuroimage 59, 1692–1699 (2012).
OpenUrl CrossRef PubMed Web of Science
34.↵
Murty, V. P. et al. Resting state networks distinguish human ventral tegmental area from substantia nigra. Neuroimage 100, 580–589 (2014).
OpenUrl CrossRef PubMed
35.↵
Murty, V. P. et al. Resting state networks distinguish human ventral tegmental area from substantia nigra. Neuroimage 100, 580–9 (2014).
OpenUrl CrossRef PubMed
36.↵
Simon, J. J. et al. Reward System Dysfunction as a Neural Substrate of Symptom Expression Across the General Population and Patients with Schizophrenia. Schizophr. Bull. 41, 1370–1378 (2015).
OpenUrl CrossRef PubMed
37.↵
Kirschner, M. et al. Ventral striatal hypoactivation is associated with apathy but not diminished expression in patients with schizophrenia. J. Psychiatry Neurosci. 41, 152–161 (2016).
OpenUrl
38.↵
Sladky, R. et al. Slice-timing effects and their correction in functional MRI. Neuroimage 58, 588–594 (2011).
OpenUrl CrossRef PubMed Web of Science
39.↵
Kirschner, M. et al. Self-regulation of the dopaminergic reward circuit in cocaine users with mental imagery and neurofeedback. EBioMedicine 37, 489–498 (2018).
OpenUrl
40.↵
Sladky, R. et al. High-resolution functional MRI of the human amygdala at 7 T. Eur. J. Radiol. 82, 728–733 (2013).
OpenUrl CrossRef PubMed
41.↵
Weissenbacher, A. et al. Correlations and anticorrelations in resting-state functional connectivity MRI: A quantitative comparison of preprocessing strategies. Neuroimage 47, 1408–1416 (2009).
OpenUrl CrossRef PubMed Web of Science
42.↵
Spunt, B. Spunt/Bspmview: Bspmview V.20161108. (2016). doi:10.5281/ZENODO.168074
OpenUrl CrossRef
43.↵
Niendam, T. A. et al. Meta-analytic evidence for a superordinate cognitive control network subserving diverse executive functions. Cogn. Affect. Behav. Neurosci. 12, 241–68 (2012).
OpenUrl CrossRef PubMed
44.↵
McLaren, D. G., Ries, M. L., Xu, G. & Johnson, S. C. A generalized form of context-dependent psychophysiological interactions (gPPI): A comparison to standard approaches. Neuroimage 61, 1277–1286 (2012).
OpenUrl CrossRef PubMed Web of Science
45.↵
Lehrl, S. Mehrfachwahl-Wortschatz-Intelligenztest MWT-B. (Spitta Verlag, Balingen, 2005).
46.↵
Krohne, H. W., Egloff, B., Kohlmann, C.-W. & Tausch, A. Untersuchung mit einer deutschen Form der Positive and Negative Affect Schedule (PANAS). Diagnostica 42, 139–156 (1996).
OpenUrl
47.↵
Preuss, U. W. et al. Psychometrische evaluation der deutschsprachigen version der Barratt-Impulsiveness-Skala. Nervenarzt 79, 305–319 (2008).
OpenUrl CrossRef PubMed
48.↵
Sutton, R. S. & Barto, A. G. Reinforcement Learning: An Introduction. (A Bradford Book, 1998).
49.↵
Sulzer, J. et al. Real-time fMRI neurofeedback: Progress and challenges. Neuroimage 76, 386–399 (2013).
OpenUrl CrossRef PubMed
50.↵
Kirschner, M. et al. Self-regulation and real time fMRI neurofeedback of the dopaminergic reward system in cocaine users Background & Objectives Key findings. Proc. Annu. Meet. Soc. Biol. Psychiatry 8001 (2017).
51.↵
Greer, S. M., Trujillo, A. J., Glover, G. H. & Knutson, B. Control of nucleus accumbens activity with neurofeedback. Neuroimage 96, 237–244 (2014).
OpenUrl
52.↵
Parro, C., Dixon, M. L. & Christoff, K. The neural basis of motivational influences on cognitive control. Hum. Brain Mapp. 39, 5097–5111 (2018).
OpenUrl CrossRef PubMed
53.↵
Marco-Pallarés, J., Müller, S. V. & Münte, T. F. Learning by doing: An fMRI study of feedback-related brain activations. Neuroreport 18, 1423–1426 (2007).
OpenUrl CrossRef PubMed Web of Science
54.↵
Emmert, K. et al. Meta-analysis of real-time fMRI neurofeedback studies using individual participant data: How is brain regulation mediated? Neuroimage 124, 806–812 (2016).
OpenUrl CrossRef
55.↵
Tenison, C., Fincham, J. M. & Anderson, J. R. Phases of learning: How skill acquisition impacts cognitive processing. Cogn. Psychol. 87, 1–28 (2016).
OpenUrl CrossRef
56.↵
Friedman, N. P. & Miyake, A. Unity and diversity of executive functions: Individual differences as a window on cognitive structure. Cortex 86, 186–204 (2017).
OpenUrl CrossRef
57.
Braver, T. S., Cole, M. W. & Yarkoni, T. Vive les differences! Individual variation in neural mechanisms of executive control. Current Opinion in Neurobiology 20, 242–250 (2010).
OpenUrl CrossRef PubMed
58.
Buhle, J. T. et al. Cognitive reappraisal of emotion: A meta-analysis of human neuroimaging studies. Cereb. Cortex 24, 2981–2990 (2014).
OpenUrl CrossRef PubMed Web of Science
59.↵
Kohn, N. et al. Neural network of cognitive emotion regulation - An ALE meta-analysis and MACM analysis. Neuroimage 87, 345–355 (2014).
OpenUrl CrossRef PubMed Web of Science
60.↵
Hendricks, M. A. & Buchanan, T. W. Individual differences in cognitive control processes and their relationship to emotion regulation. Cogn. Emot. 30, 912–924 (2016).
OpenUrl
61.↵
Arnsten, A. F. T., Wang, M. & Paspalas, C. D. Dopamine’s Actions in Primate Prefrontal Cortex: Challenges for Treating Cognitive Disorders. Pharmacol. Rev. 67, 681–696 (2015).
OpenUrl Abstract/FREE Full Text
62.↵
Beck, A. T. The current state of cognitive therapy: A 40-year retrospective. Archives of General Psychiatry 62, 953–959 (2005).
OpenUrl CrossRef PubMed Web of Science
63.↵
Lynch, T. R., Trost, W. T., Salsman, N. & Linehan, M. M. Dialectical Behavior Therapy for Borderline Personality Disorder. Annu. Rev. Clin. Psychol. 3, 181–205 (2007).
OpenUrl CrossRef PubMed
64.↵
Bateman, A. & Fonagy, P. Mentalization based treatment for borderline personality disorder. World Psychiatry 9, 11–5 (2010).
OpenUrl CrossRef PubMed Web of Science
65.
Maroda, K. J. Psychodynamic Techniques: Working with Emotion in the Therapeutic Relationship. Journal of Phenomenological Psychology (Guilford Press, 2010). doi:10.1163/156916211x567505
OpenUrl CrossRef
66.↵
Have-de Labije, J. & Neborsky, R. Mastering intensive short-term dynamic psychotherapy: a roadmap to the unconscious. (Karnac Books, 2012).
67.↵
Young, K. D. et al. Real-Time Functional Magnetic Resonance Imaging Amygdala Neurofeedback Changes Positive Information Processing in Major Depressive Disorder. Biol. Psychiatry 82, 578–586 (2017).
OpenUrl
68.↵
Koob, G. F. & Volkow, N. D. Neurocircuitry of addiction. Neuropsychopharmacology 35, 217–38 (2010).
OpenUrl CrossRef PubMed Web of Science
69.
Holmes, A. J., Hollinshead, M. O., Roffman, J. L., Smoller, J. W. & Buckner, R. L. Individual Differences in Cognitive Control Circuit Anatomy Link Sensation Seeking, Impulsivity, and Substance Use. J. Neurosci. 36, 4038–4049 (2016).
OpenUrl Abstract/FREE Full Text
70.↵
George, O. & Koob, G. F. Individual differences in prefrontal cortex function and the transition from drug use to drug dependence. Neuroscience and Biobehavioral Reviews 35, 232–247 (2010).
OpenUrl CrossRef PubMed Web of Science
71.↵
Pearce, J. Animal Learning and Cognition: An Introduction. (2008).
72.↵
Ballard, I. C. et al. Dorsolateral Prefrontal Cortex Drives Mesolimbic Dopaminergic Regions to Initiate Motivated Behavior. J. Neurosci. 31, 10340–10346 (2011).
OpenUrl Abstract/FREE Full Text
73.↵
Jo, Y. S. & Mizumori, S. J. Prefrontal Regulation of Neuronal Activity in the Ventral Tegmental Area. Cereb. Cortex 26, 4057–4068 (2016).
OpenUrl CrossRef PubMed
74.↵
Schenk, L. A., Sprenger, C., Onat, S., Colloca, L. & Büchel, C. Suppression of Striatal Prediction Errors by the Prefrontal Cortex in Placebo Hypoalgesia. J. Neurosci. 37, 9715–9723 (2017).
OpenUrl Abstract/FREE Full Text
75.
Weber, S. C., Kahnt, T., Quednow, B. B. & Tobler, P. N. Frontostriatal pathways gate processing of behaviorally relevant reward dimensions. PLoS Biol. 16, e2005722 (2018).
OpenUrl
76.↵
Chatham, C. H., Frank, M. J. & Badre, D. Corticostriatal output gating during selection from working memory. Neuron 81, 930–942 (2014).
OpenUrl CrossRef PubMed
77.↵
Tik, M. et al. Ultra-high-field fMRI insights on insight: Neural correlates of the Aha!-moment. Hum. Brain Mapp. 39, 3241–3252 (2018).
OpenUrl
78.↵
Hahn, A. et al. Individual Diversity of Functional Brain Network Economy. Brain Connect. 5, 156–165 (2014).
OpenUrl
79.↵
Wark, B., Lundstrom, B. N. & Fairhall, A. Sensory adaptation. Curr. Opin. Neurobiol. 17, 423–9 (2007).
OpenUrl CrossRef PubMed Web of Science
80.↵
Kirschner, M. et al. Deficits in context-dependent adaptive coding in early psychosis and healthy individuals with schizotypal personality traits. Brain 141, 2806–2819 (2018).
OpenUrl
81.↵
Trutti, A. C., Mulder, M. J., Hommel, B. & Forstmann, B. U. Functional neuroanatomical review of the ventral tegmental area. NeuroImage 191, 258–268 (2019).
OpenUrl
82.↵
Düzel, E. et al. Functional imaging of the human dopaminergic midbrain. Trends Neurosci. 32, 321–328 (2009).
OpenUrl CrossRef PubMed Web of Science
83.↵
Hellrung, L. et al. Intermittent compared to continuous real-time fMRI neurofeedback boosts control over amygdala activation. Neuroimage 166, (2018).

View the discussion thread.

Posted March 13, 2020.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Neuroscience

Subject Areas

All Articles

Animal Behavior and Cognition (5200)
Biochemistry (11703)
Bioengineering (8718)
Bioinformatics (29127)
Biophysics (14930)
Cancer Biology (12048)
Cell Biology (17353)
Clinical Trials (138)
Developmental Biology (9406)
Ecology (14143)
Epidemiology (2067)
Evolutionary Biology (18266)
Genetics (12219)
Genomics (16765)
Immunology (11841)
Microbiology (28003)
Molecular Biology (11551)
Neuroscience (60804)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3229)
Physiology (4939)
Plant Biology (10383)
Scientific Communication and Education (1679)
Synthetic Biology (2877)
Systems Biology (7333)
Zoology (1642)

[1] 1.↵
Schultz, W. Predictive Reward Signal of Dopamine Neurons. J. Neurophysiol. 80, 1–27 (1998).
OpenUrl CrossRef PubMed Web of Science

[2] 2.↵
Schultz, W. Dopamine reward prediction error coding. Dialogues Clin. Neurosci. 18, 23–32 (2016).
OpenUrl CrossRef PubMed

[3] 3.
Burke, C. J. & Tobler, P. N. Time, Not Size, Matters for Striatal Reward Predictions to Dopamine. Neuron 91, 8–11 (2016).
OpenUrl

[4] 4.↵
Tobler, P. N., Fletcher, P. C., Bullmore, E. T. & Schultz, W. Learning-Related Human Brain Activations Reflecting Individual Finances. Neuron 54, 167–175 (2007).
OpenUrl CrossRef PubMed Web of Science

[5] 5.↵
Bromberg-Martin, E. S., Matsumoto, M. & Hikosaka, O. Dopamine in Motivational Control: Rewarding, Aversive, and Alerting. Neuron 68, 815–834 (2010).
OpenUrl CrossRef PubMed Web of Science

[6] 6.↵
Wise, R. A. Dopamine, learning and motivation. Nature Reviews Neuroscience 5, 483–494 (2004).
OpenUrl CrossRef PubMed Web of Science

[7] 7.↵
Friston, K. et al. The anatomy of choice: Dopamine and decision-making. Philos. Trans. R. Soc. B Biol. Sci. 369, (2014).

[8] 8.↵
Huys, Q. J. M., Tobler, P. N., Hasler, G. & Flagel, S. B. The role of learning-related dopamine signals in addiction vulnerability. in Progress in Brain Research 211, 31–77 (2014).
OpenUrl CrossRef PubMed

[9] 9.↵
Deserno, L., Schlagenhauf, F. & Heinz, A. Striatal dopamine, reward, and decision making in schizophrenia. Dialogues Clin. Neurosci. 18, 77–89 (2016).
OpenUrl

[10] 10.↵
Maia, T. V. & Frank, M. J. An Integrative Perspective on the Role of Dopamine in Schizophrenia. Biol. Psychiatry 81, 52–66 (2017).
OpenUrl

[11] 11.↵
Meder, D., Herz, D. M., Rowe, J. B., Lehéricy, S. & Siebner, H. R. The role of dopamine in the brain - lessons learned from Parkinson’s disease. Neuroimage 190, 79–93 (2019).
OpenUrl

[12] 12.↵
Sulzer, J. et al. Neurofeedback-mediated self-regulation of the dopaminergic midbrain. Neuroimage 75, 176–184 (2013).
OpenUrl

[13] 13.↵
MacInnes, J. J., Dickerson, K. C., Chen, N. kuei & Adcock, R. A. Cognitive Neurostimulation: Learning to Volitionally Sustain Ventral Tegmental Area Activation. Neuron 89, 1331–1342 (2016).
OpenUrl

[14] 14.↵
Kirschner, M. et al. Self-regulation of the Dopaminergic Reward Circuit in Cocaine Users with Mental Imagery and Neurofeedback. bioRxiv (2018).

[15] 15.↵
Klein, M. O. et al. Dopamine: Functions, Signaling, and Association with Neurological Diseases. Cell. Mol. Neurobiol. 39, 31–59 (2019).
OpenUrl CrossRef

[16] 16.↵
Alkoby, O., Abu-Rmileh, A., Shriki, O. & Todder, D. Can We Predict Who Will Respond to Neurofeedback? A Review of the Inefficacy Problem and Existing Predictors for Successful EEG Neurofeedback Learning. Neuroscience 378, 155–164 (2018).
OpenUrl

[17] 17.↵
Sitaram, R. et al. Closed-loop brain training: the science of neurofeedback. Nat. Rev. Neurosci. 18, 86–100 (2016).
OpenUrl CrossRef

[18] 18.↵
Wu, J. et al. Cortical control of VTA function and influence on nicotine reward. Biochemical Pharmacology 86, 1173–1180 (2013).
OpenUrl

[19] 19.↵
Frankle, W. G., Laruelle, M. & Haber, S. N. Prefrontal cortical projections to the midbrain in primates: Evidence for a sparse connection. Neuropsychopharmacology 31, 1627–1636 (2006).
OpenUrl CrossRef PubMed Web of Science

[20] 20.↵
Gao, M. et al. Functional Coupling between the Prefrontal Cortex and Dopamine Neurons in the Ventral Tegmental Area. J. Neurosci. 27, 5414–5421 (2007).
OpenUrl Abstract/FREE Full Text

[21] 21.↵
Sesack, S. R., Carr, D. B., Omelchenko, N. & Pinto, A. Anatomical Substrates for Glutamate-Dopamine Interactions: Evidence for Specificity of Connections and Extrasynaptic Actions. in Annals of the New York Academy of Sciences 1003, 36–52 (John Wiley & Sons, Ltd (10.1111), 2003).
OpenUrl CrossRef PubMed Web of Science

[22] 22.↵
Birbaumer, N., Ruiz, S. & Sitaram, R. Learned regulation of brain metabolism. Trends Cogn. Sci. 17, 295–302 (2013).
OpenUrl CrossRef PubMed

[23] 23.↵
Oblak, E. F., Lewis-Peacock, J. A. & Sulzer, J. S. Self-regulation strategy, feedback timing and hemodynamic properties modulate learning in a simulated fMRI neurofeedback environment. PLoS Comput. Biol. 13, e1005681 (2017).
OpenUrl

[24] 24.↵
Radua, J., Stoica, T., Scheinost, D., Pittenger, C. & Hampson, M. Neural Correlates of Success and Failure Signals During Neurofeedback Learning. Neuroscience 378, 11–21 (2018).
OpenUrl

[25] 25.↵
Ferenczi, E. A. et al. Prefrontal cortical regulation of brainwide circuit dynamics and reward-related behavior. Science (80-.). 351, aac9698–aac9698 (2016).
OpenUrl Abstract/FREE Full Text

[26] 26.↵
Tsai, H. C. et al. Phasic firing in dopaminergic neurons is sufficient for behavioral conditioning. Science (80.). 324, 1080–1084 (2009).
OpenUrl Abstract/FREE Full Text

[27] 27.↵
D’Ardenne, K., McClure, S. M., Nystrom, L. E. & Cohen, J. D. BOLD responses reflecting dopaminergic signals in the human ventral tegmental area. Science (80.). 319, 1264–1267 (2008).
OpenUrl Abstract/FREE Full Text

[28] 28.↵
Zaghloul, K. A. et al. Human substantia nigra neurons encode unexpected financial rewards. Science (80.). 323, 1496–1499 (2009).
OpenUrl Abstract/FREE Full Text

[29] 29.↵
Steinberg, E. E. et al. A causal link between prediction errors, dopamine neurons and learning. Nat. Neurosci. 16, 966–973 (2013).
OpenUrl CrossRef PubMed

[30] 30.↵
Keiflin, R., Pribut, H. J., Shah, N. B. & Janak, P. H. Ventral Tegmental Dopamine Neurons Participate in Reward Identity Predictions. Curr. Biol. 29, 93–103.e3 (2019).
OpenUrl CrossRef

[31] 31.↵
Tobler, P. N., Fiorillo, C. D. & Schultz, W. Adaptive coding of reward value by dopamine neurons. Science 307, 1642–5 (2005).
OpenUrl Abstract/FREE Full Text

[32] 32.↵
Kirschner, M. et al. Deficits in context-dependent adaptive coding in early psychosis and healthy individuals with schizotypal personality traits. Brain 141, 2806–2819 (2018).
OpenUrl

[33] 33.↵
Miyapuram, K. P., Tobler, P. N., Gregorios-Pippas, L. & Schultz, W. BOLD responses in reward regions to hypothetical and imaginary monetary rewards. Neuroimage 59, 1692–1699 (2012).
OpenUrl CrossRef PubMed Web of Science

[34] 34.↵
Murty, V. P. et al. Resting state networks distinguish human ventral tegmental area from substantia nigra. Neuroimage 100, 580–589 (2014).
OpenUrl CrossRef PubMed

[35] 35.↵
Murty, V. P. et al. Resting state networks distinguish human ventral tegmental area from substantia nigra. Neuroimage 100, 580–9 (2014).
OpenUrl CrossRef PubMed

[36] 36.↵
Simon, J. J. et al. Reward System Dysfunction as a Neural Substrate of Symptom Expression Across the General Population and Patients with Schizophrenia. Schizophr. Bull. 41, 1370–1378 (2015).
OpenUrl CrossRef PubMed

[37] 37.↵
Kirschner, M. et al. Ventral striatal hypoactivation is associated with apathy but not diminished expression in patients with schizophrenia. J. Psychiatry Neurosci. 41, 152–161 (2016).
OpenUrl

[38] 38.↵
Sladky, R. et al. Slice-timing effects and their correction in functional MRI. Neuroimage 58, 588–594 (2011).
OpenUrl CrossRef PubMed Web of Science

[39] 39.↵
Kirschner, M. et al. Self-regulation of the dopaminergic reward circuit in cocaine users with mental imagery and neurofeedback. EBioMedicine 37, 489–498 (2018).
OpenUrl

[40] 40.↵
Sladky, R. et al. High-resolution functional MRI of the human amygdala at 7 T. Eur. J. Radiol. 82, 728–733 (2013).
OpenUrl CrossRef PubMed

[41] 41.↵
Weissenbacher, A. et al. Correlations and anticorrelations in resting-state functional connectivity MRI: A quantitative comparison of preprocessing strategies. Neuroimage 47, 1408–1416 (2009).
OpenUrl CrossRef PubMed Web of Science

[42] 42.↵
Spunt, B. Spunt/Bspmview: Bspmview V.20161108. (2016). doi:10.5281/ZENODO.168074
OpenUrl CrossRef

[43] 43.↵
Niendam, T. A. et al. Meta-analytic evidence for a superordinate cognitive control network subserving diverse executive functions. Cogn. Affect. Behav. Neurosci. 12, 241–68 (2012).
OpenUrl CrossRef PubMed

[44] 44.↵
McLaren, D. G., Ries, M. L., Xu, G. & Johnson, S. C. A generalized form of context-dependent psychophysiological interactions (gPPI): A comparison to standard approaches. Neuroimage 61, 1277–1286 (2012).
OpenUrl CrossRef PubMed Web of Science

[45] 45.↵
Lehrl, S. Mehrfachwahl-Wortschatz-Intelligenztest MWT-B. (Spitta Verlag, Balingen, 2005).

[46] 46.↵
Krohne, H. W., Egloff, B., Kohlmann, C.-W. & Tausch, A. Untersuchung mit einer deutschen Form der Positive and Negative Affect Schedule (PANAS). Diagnostica 42, 139–156 (1996).
OpenUrl

[47] 47.↵
Preuss, U. W. et al. Psychometrische evaluation der deutschsprachigen version der Barratt-Impulsiveness-Skala. Nervenarzt 79, 305–319 (2008).
OpenUrl CrossRef PubMed

[48] 48.↵
Sutton, R. S. & Barto, A. G. Reinforcement Learning: An Introduction. (A Bradford Book, 1998).

[49] 49.↵
Sulzer, J. et al. Real-time fMRI neurofeedback: Progress and challenges. Neuroimage 76, 386–399 (2013).
OpenUrl CrossRef PubMed

[50] 50.↵
Kirschner, M. et al. Self-regulation and real time fMRI neurofeedback of the dopaminergic reward system in cocaine users Background & Objectives Key findings. Proc. Annu. Meet. Soc. Biol. Psychiatry 8001 (2017).

[51] 51.↵
Greer, S. M., Trujillo, A. J., Glover, G. H. & Knutson, B. Control of nucleus accumbens activity with neurofeedback. Neuroimage 96, 237–244 (2014).
OpenUrl

[52] 52.↵
Parro, C., Dixon, M. L. & Christoff, K. The neural basis of motivational influences on cognitive control. Hum. Brain Mapp. 39, 5097–5111 (2018).
OpenUrl CrossRef PubMed

[53] 53.↵
Marco-Pallarés, J., Müller, S. V. & Münte, T. F. Learning by doing: An fMRI study of feedback-related brain activations. Neuroreport 18, 1423–1426 (2007).
OpenUrl CrossRef PubMed Web of Science

[54] 54.↵
Emmert, K. et al. Meta-analysis of real-time fMRI neurofeedback studies using individual participant data: How is brain regulation mediated? Neuroimage 124, 806–812 (2016).
OpenUrl CrossRef

[55] 55.↵
Tenison, C., Fincham, J. M. & Anderson, J. R. Phases of learning: How skill acquisition impacts cognitive processing. Cogn. Psychol. 87, 1–28 (2016).
OpenUrl CrossRef

[56] 56.↵
Friedman, N. P. & Miyake, A. Unity and diversity of executive functions: Individual differences as a window on cognitive structure. Cortex 86, 186–204 (2017).
OpenUrl CrossRef

[57] 57.
Braver, T. S., Cole, M. W. & Yarkoni, T. Vive les differences! Individual variation in neural mechanisms of executive control. Current Opinion in Neurobiology 20, 242–250 (2010).
OpenUrl CrossRef PubMed

[58] 58.
Buhle, J. T. et al. Cognitive reappraisal of emotion: A meta-analysis of human neuroimaging studies. Cereb. Cortex 24, 2981–2990 (2014).
OpenUrl CrossRef PubMed Web of Science

[59] 59.↵
Kohn, N. et al. Neural network of cognitive emotion regulation - An ALE meta-analysis and MACM analysis. Neuroimage 87, 345–355 (2014).
OpenUrl CrossRef PubMed Web of Science

[60] 60.↵
Hendricks, M. A. & Buchanan, T. W. Individual differences in cognitive control processes and their relationship to emotion regulation. Cogn. Emot. 30, 912–924 (2016).
OpenUrl

[61] 61.↵
Arnsten, A. F. T., Wang, M. & Paspalas, C. D. Dopamine’s Actions in Primate Prefrontal Cortex: Challenges for Treating Cognitive Disorders. Pharmacol. Rev. 67, 681–696 (2015).
OpenUrl Abstract/FREE Full Text

[62] 62.↵
Beck, A. T. The current state of cognitive therapy: A 40-year retrospective. Archives of General Psychiatry 62, 953–959 (2005).
OpenUrl CrossRef PubMed Web of Science

[63] 63.↵
Lynch, T. R., Trost, W. T., Salsman, N. & Linehan, M. M. Dialectical Behavior Therapy for Borderline Personality Disorder. Annu. Rev. Clin. Psychol. 3, 181–205 (2007).
OpenUrl CrossRef PubMed

[64] 64.↵
Bateman, A. & Fonagy, P. Mentalization based treatment for borderline personality disorder. World Psychiatry 9, 11–5 (2010).
OpenUrl CrossRef PubMed Web of Science

[65] 65.
Maroda, K. J. Psychodynamic Techniques: Working with Emotion in the Therapeutic Relationship. Journal of Phenomenological Psychology (Guilford Press, 2010). doi:10.1163/156916211x567505
OpenUrl CrossRef

[66] 66.↵
Have-de Labije, J. & Neborsky, R. Mastering intensive short-term dynamic psychotherapy: a roadmap to the unconscious. (Karnac Books, 2012).

[67] 67.↵
Young, K. D. et al. Real-Time Functional Magnetic Resonance Imaging Amygdala Neurofeedback Changes Positive Information Processing in Major Depressive Disorder. Biol. Psychiatry 82, 578–586 (2017).
OpenUrl

[68] 68.↵
Koob, G. F. & Volkow, N. D. Neurocircuitry of addiction. Neuropsychopharmacology 35, 217–38 (2010).
OpenUrl CrossRef PubMed Web of Science

[69] 69.
Holmes, A. J., Hollinshead, M. O., Roffman, J. L., Smoller, J. W. & Buckner, R. L. Individual Differences in Cognitive Control Circuit Anatomy Link Sensation Seeking, Impulsivity, and Substance Use. J. Neurosci. 36, 4038–4049 (2016).
OpenUrl Abstract/FREE Full Text

[70] 70.↵
George, O. & Koob, G. F. Individual differences in prefrontal cortex function and the transition from drug use to drug dependence. Neuroscience and Biobehavioral Reviews 35, 232–247 (2010).
OpenUrl CrossRef PubMed Web of Science

[71] 71.↵
Pearce, J. Animal Learning and Cognition: An Introduction. (2008).

[72] 72.↵
Ballard, I. C. et al. Dorsolateral Prefrontal Cortex Drives Mesolimbic Dopaminergic Regions to Initiate Motivated Behavior. J. Neurosci. 31, 10340–10346 (2011).
OpenUrl Abstract/FREE Full Text

[73] 73.↵
Jo, Y. S. & Mizumori, S. J. Prefrontal Regulation of Neuronal Activity in the Ventral Tegmental Area. Cereb. Cortex 26, 4057–4068 (2016).
OpenUrl CrossRef PubMed

[74] 74.↵
Schenk, L. A., Sprenger, C., Onat, S., Colloca, L. & Büchel, C. Suppression of Striatal Prediction Errors by the Prefrontal Cortex in Placebo Hypoalgesia. J. Neurosci. 37, 9715–9723 (2017).
OpenUrl Abstract/FREE Full Text

[75] 75.
Weber, S. C., Kahnt, T., Quednow, B. B. & Tobler, P. N. Frontostriatal pathways gate processing of behaviorally relevant reward dimensions. PLoS Biol. 16, e2005722 (2018).
OpenUrl

[76] 76.↵
Chatham, C. H., Frank, M. J. & Badre, D. Corticostriatal output gating during selection from working memory. Neuron 81, 930–942 (2014).
OpenUrl CrossRef PubMed

[77] 77.↵
Tik, M. et al. Ultra-high-field fMRI insights on insight: Neural correlates of the Aha!-moment. Hum. Brain Mapp. 39, 3241–3252 (2018).
OpenUrl

[78] 78.↵
Hahn, A. et al. Individual Diversity of Functional Brain Network Economy. Brain Connect. 5, 156–165 (2014).
OpenUrl

[79] 79.↵
Wark, B., Lundstrom, B. N. & Fairhall, A. Sensory adaptation. Curr. Opin. Neurobiol. 17, 423–9 (2007).
OpenUrl CrossRef PubMed Web of Science

[80] 80.↵
Kirschner, M. et al. Deficits in context-dependent adaptive coding in early psychosis and healthy individuals with schizotypal personality traits. Brain 141, 2806–2819 (2018).
OpenUrl

[81] 81.↵
Trutti, A. C., Mulder, M. J., Hommel, B. & Forstmann, B. U. Functional neuroanatomical review of the ventral tegmental area. NeuroImage 191, 258–268 (2019).
OpenUrl

[82] 82.↵
Düzel, E. et al. Functional imaging of the human dopaminergic midbrain. Trends Neurosci. 32, 321–328 (2009).
OpenUrl CrossRef PubMed Web of Science

[83] 83.↵
Hellrung, L. et al. Intermittent compared to continuous real-time fMRI neurofeedback boosts control over amygdala activation. Neuroimage 166, (2018).