Bradley C. Love

The Psychology of Persuasion

Fri, 16 Jul 2021 00:00:00 +0000

When we are on the right side of an argument, most of us believe presenting the facts and supporting evidence should be enough to persuade others. Instead, we are baffled when friends and family continue to vote for policies that run counter to their interests or pour the milk before the tea. Presenting evidence is not enough for persuasion because people are motivated reasoners driven by their core values and community membership. Rather than weight the evidence in an unbiased fashion, people construct narratives or stories to understand themselves and the world.

Decision Making as Story Telling

Imagine a jury sitting on a murder trial. The jurors aren’t weighting the probabilities of all the possible scenarios taking all the evidence into account. Instead, they are considering whether the story told by the prosecutor or defence is more coherent and persuasive. Once they settle on a story, evidence is interpreted in light of that narrative. Oddly, a story might be more compelling by only focusing on the strongest points at the expense of mentioning all the supporting evidence.

In our personal lives, we also tell stories about ourselves. We aren’t going to be receptive to information that conflicts with our personal story, such as being told we are racist. We also understand our actions through story telling. For example, we come to like what we purchase in the supermarket rather than simply purchase what we like. After all, why would we buy and eat something that we didn’t like? When it becomes difficult to explain a choice, such as when confronted with an aisle full of different jams that all would do, we can lapse into inaction.

The Story Teller Matters

We are rarely persuaded by our enemies. Common ground and shared values are lubricants for persuasion. For example, someone denying climate change in the presence of overwhelming evidence may do so because of broader motivations, such as fearing increased government regulations and being forced to give up their car. Someone on the same “team” who shares these values and goals is best positioned to make the case for climate change, whereas an environmental campaigner who favours more socialist policies and rides a bike to work is likely to be discounted presenting the same evidence. A blowback effect could even occur where the climate change denier takes the environmentalist’s “lies” as further evidence for the hoax whose true aim is to dismantle their way of life. People tend to follow community norms.

Persuasion to Action

Persuading someone does not guarantee action. For example, many people support politicians but don’t vote. To translate beliefs into actions, people need specific plans and triggers. A potential voter would need to reserve time in their diary and arrange transport to the polling station. Action happens when the environment supports it. Indeed, the basic idea of Nudge is not to persuade per se, but to make it easier for people to make the “right” choice, such as when organ donation is the default option. Like persuasion, action is not all about education. Facts matter, but sadly not as much as we would like to believe.

A neuroscience-inspired approach to transfer learning

Thu, 22 Oct 2020 00:00:00 +0000

Inspired by the brain, we find a goal-directed attention approach to feature reuse bests a commonly used machine learning strategy (Luo et al., 2020). In particular, attentional modulation of mid-level features in deep convolutional neural networks is more effective than retraining the last layer to transfer to a new task.

Neuroscience and machine learning have been enjoying a virtuous cycle in which advances in one field spurs advances in the other. For example, deep convolutional neural networks (DCNNs) were motivated by the organisation of the visual cortex. In this blog, we highlight another success for neuroscience-inspired approaches, namely using goal-directed attention to repurpose an existing network for a new task.

Goal-directed attention in humans

When searching for one’s car keys, a sensible strategy is to prioritise small and metallic objects. Focusing on goal-directed features at the expense of irrelevant features can increase one’s chances of finding the target item. Instead of retraining one’s brain for this particular recognition task, people use goal-directed attention to modulate activity in their visual system.

Figure 2 from Luo et al., 2020: The absence of a strong top-down signal (left) to guide visual processing leads to uncertainty about what this confusing image depicts. In contrast, when there is an expectation that a dog is present (right) the visual system is reconfigured to be more sensitive and biased toward supporting information, which leads to successful recognition of the Dalmatian.

Conventional transfer learning in machine learning

In contrast, one popular method for transfer learning in machine learning is to remove the final layer of the DCNN and retrain it for the new task. Like the attentional approach, most aspects of the original network are preserved. For example, all the useful features previously learned could be reused for a task that prioritises finding one’s keys. To provide another example, a DCNN model pre-trained on ImageNet could be fine-tuned into a cats-vs-dogs detector using very little data.

An alternative approach: goal directed attention

Goal-directed attention and transfer learning approaches reuse existing features, but there is a critical difference. In the brain, goal-directed attention primarily operates at mid- to late-stages of the ventral visual stream. Our networks with goal-directed attention operate similarly. In contrast, transfer learning adjusts features at the very end of a DCNN. How does a neuroscience-inspired approach compare to the standard machine learning approach?

Here, we describe a study in which we incorporate goal-directed attention into the mid-level of a DCNN and use it as an alternative to the transfer learning approach. Results from three object recognition tasks favour the neuroscience-inspired approach both in terms of performance and ability to scale.

Incorporating goal-directed attention in DCNN

In cognitive neuroscience, goal-directed attention is a mechanism that emphasises or de-emphasises features based on their task relevance. This is often formalised as the stretching and contracting of psychological feature dimensions.

Figure 1 from Luo et al., 2020: Attention alters the importance of feature dimensions. Four kitchen objects vary on two feature dimensions: albedo and size. In this example, albedo is the attended dimension (hence stretched) whereas attention to size is tuned down (hence compressed). Consequently, the key becomes more similar to the silver toaster than to the chopping board or salt shaker.

To incorporate this principle into DCNN models, we introduce a goal-directed attention layer at the mid-level of a pre-trained DCNN that can direct its focus on a set of features based on their goal relevance.

Figure 4 from Luo et al., 2020: Integration of Attention Layer with VGG-16. The attention layer is constructed with the same shape as the output representation of the preceding layer but constrained such that a single filter value is used across all spatial locations. The attention operation is carried out as a Hadamard product between the pre-attention activations and attention weights. As the bottom panel shows, previously highly activated filter can be tuned down by a small attention weight (colour from dark to bright) whereas previously barely activated filter can become highly activated due to attention re-weighting (colour from bright to dark)

Attention beats convention

Models trained on ImageNet using either approach are tested on three object recognition tasks involving standard ImageNet images, blended images and natural adversarial images. Natural adversarial images exploit vulnerabilities in DCNNs such as colour and texture biases (Hendrycks et al., 2019).

Figure 3 from Luo et al., 2020: (Left) A standard image from ImageNet’s Tabby Cat category (Deng et al., 2009). (Middle) A blended image by alpha-blending an image of a cat and an image of a dog. (Right) A natural adversarial image of a dragonfly misclassified as banana by DenseNet-121 with high confidence (Hendrycks et al., 2019).

All three tests follow the same procedure involving both target and non-target images. For example, when testing a model dedicated to detecting Chihuahuas, an equal number of Chihuahua and non-Chihuahua images are used to tune the network. For each model, we assess performance using signal detection theory.

We found that the goal-directed attention approach generally outperformed (i.e., higher $d^\prime$) the widely used transfer learning approach in all three tasks.

One explanation is that even though the attention layer had fewer tunable parameters ($512$ vs. $4,096,000$ parameters) than the retraining approach, the cascading effects through subsequent network layers provided the needed flexibility to match the task goal. The results suggest that this neuroscience-inspired approach can enable the model to more effectively adapt to new tasks at a relatively low cost. Additionally, since each attention weight has a unique correspondence to the entire feature map from the preceding layer, this goal-directed mechanism can potentially be more interpretable than the fully connected weights.

Model-based fMRI giveth and taketh away

Mon, 18 Nov 2019 00:00:00 +0000

What’s better than fMRI or cognitive modelling? Of course, their combination in the form of model-based fMRI! Rather than evaluating simple contrasts based on the experimental design, such as where in the brain lights up more for houses vs. faces, model-based fMRI evaluates proposed cognitive processes and representations.

In this blog, we’ll first consider an example of how model-based fMRI reveals aspects of brain activity that would not easily be found by standard methods. Then, we’ll get to the main story and share a new finding with you from a large-scale neuroeconomics study called NARPS (Neuroimaging Analysis Replication and Prediction Study; bioRxiv preprint).

NARPS evaluated how the many possible ways one can analyse fMRI data can affect the conclusions researchers draw. Seventy different research labs, including our own, signed up to partake in this endeavour and were given a few months to independently complete the analyses. In NARPS, the researchers were in a sense the study participants — NARPS asked whether the analysis choices researchers make shapes basic scientific conclusions.

We went one step further and analysed the data two different ways ourselves. One analysis was fairly standard (what we thought most teams would do) whereas the other approach was model-based. As you will see below, many findings that were found in the traditional analysis were deemed spurious in the model-based analysis. Before jumping into this main story, we’ll consider an example in which model-based analysis allowed for a discovery that would not be possible with traditional methods. Model-based analysis giveth and taketh away!

Model-Based analysis giveth

Figure 3 from Davis et al. (2012): Illustrations of the model-based measures used to characterise (fMRI) BOLD response. Brain regions associated with the cognitive model's recognition strength measure are depicted in red, and brain regions associated with the category match (measured in terms of entropy) are depicted in cyan. The bottom panel represents the predicted shape of each model-based regressor for the two item-types over the course of the experiment. For the model-based measures, the predicted pattern for exception trials is given in red, and the predicted pattern for rule-following trials is given in green. Model-based analysis allows for two simultaneous occurring cognitive processes to be localised.

In learning studies, the time course of how representations are acquired and updated is critical. Model-based analyses, in which a cognitive model is fit to behavioural data, can be used to capture such changes across trials. In the figure above, a category learning model was fit to behaviour and then internal model measures of item recognition and category match were extracted and used to analyse the (fMRI) BOLD response in the hippocampus. In other studies of learning, model-based fMRI allowed processes in different trial phases (decision vs. feedback processing) to be isolated. The cognitive models made it possible to quantify these hypothesised mental operations that were not directly observable.

These examples focus on univariate relationships between the brain and model measure, but it is also possible to analyse patterns of activity, such as how internal representations in the model parallel patterns of activity across voxels in the brain. Of course, any model-based analysis is only as good as the model. The cognitive model should be supported by previous work, including evaluation in behavioural studies. Decoding methods can also be used to test which of a set of competing models is most consistent with the BOLD response.

Model-based analysis taketh away

The previous examples of model-based analysis revealed effects that would not otherwise be observable. Model-based analysis can also “remove” effects that are probably misleading (i.e., false alarms).

In the NARPS project, our team conducted a model-based analysis that yielded some results at odds with the standard approach. The difference we found is germane to the goal of NARPS – NARPS is interested in how the many possible data analysis pipelines for fMRI data affect our scientific conclusions. The primary goal of NARPS is to examine the variability of fMRI data analyses as carried out by different groups of researchers. Seventy different research teams signed up to partake in this endeavour and were given a few months to complete the analyses. We were one of those seventy independent teams that made NARPS possible.

Regarding the data itself, over a hundred participants engaged in a standard decision making task (like Tom et al., 2007 and De Martino et al., 2010), for more information see NARPS Data & Analysis or the bioRxiv preprint . While in the scanner they had to accept or reject gambles in the form of unbiased coin flips; each gamble could incur in either gains or losses. Participants were either in a group where the gambles were calibrated for loss aversion (equal indifference) or not (equal range).

To get at the variability in analysing fMRI data, teams performed whole-brain corrected analyses and submitted their binary (yes/no) decisions regarding nine hypotheses for specific contrasts related to previous work (Tom et al., 2007; De Martino et al., 2010; Canessa et al., 2013; Canessa et al., 2017). The hypotheses are presented in the following table along with four columns: our expected results before analyzing the data, our model-absent results (i.e., gains and losses only), our model-present results (i.e., gains, losses, and decision entropy — explained below), and prediction market results. After the submission deadline (end of February), prediction markets were organized for all hypotheses (similar to Camerer et al., 2016; Camerer et al., 2018; Dreber et al., 2015) for a little over a week at the beginning of May. The researcher prediction market closed with the values seen in the table (arbitrary token units), which were highly correlated with the fundamental values reported in the preprint.

	Expected	Gains & Losses only	Gains, Losses, & Decision Entropy	Researcher Prediction Markets
Parametric effect of gain
1. Positive effect in ventromedial prefrontal cortex (vmPFC) - for the equal indifference group	yes	yes	no	0.814
2. Positive effect in vmPFC - for the equal range group	yes	no	no	0.753
3. Positive effect in ventral striatum (VS) - for the equal indifference group	yes	yes	no	0.743
4. Positive effect in VS - for the equal range group	yes	yes	yes	0.789
Parametric effect of loss
5. Negative effect in vmPFC - for the equal indifference group	yes	yes	yes	0.952
6. Negative effect in vmPFC - for the equal range group	yes	yes	yes	0.805
7. Positive effect in amygdala - for the equal indifference group	-	no	no	0.073
8. Positive effect in amygdala - for the equal range group	-	no	no	0.274
9. Greater positive response to losses in amygdala for equal range condition vs. equal indifference condition.	-	no	no	0.188

Our expectations and model-absent analysis

Given that the previous literature shows strong effects of value in both VS and vmPFC, we suspected that the majority of teams would answer “yes” for hypotheses 1-6. As for the hypotheses related to effects in the amygdala (hypotheses 7-9), we were indifferent given the conflicting findings for this area (e.g., Tom et al., 2007 and De Martino et al., 2010). Our initial expectations were then updated based on directly regressing gain and loss values from the experimental design onto the blood-oxygen-level dependent (BOLD) signal. However, below we explain what we think is the better model after including an additional term (i.e., inverse decision entropy) that captures something akin to decision confidence present in this task which we estimated from behaviour.

Our model

For our reported results, first, we estimated parameters from a simple cognitive model fit to behaviour: a logistic regression with an intercept and separate terms for gains and losses, predicting either accept or reject gamble. From the model’s predictions $p_{\mathrm{accept}}$, we were also able to calculate the inverse decision entropy:

\[\mathrm{iDE} = p_{\mathrm{accept}} \times log_2(p_{\mathrm{accept}}) + p_\mathrm{reject} \times log_2(p_{\mathrm{reject}})\]

for each gamble (see figure below). (We use inverse decision entropy because it aligns with intuitive notions of decision confidence.) Second, the BOLD model consisted of an intercept, gains, losses, and inverse decision entropy, as well as an assortment of standard movement nuisance regressors (i.e., rotations, translations, and framewise displacement).

Behavioural model and task. Three equations describe the behavioural model in a) where subjective value (SV) is a weighted combination of gains and losses, $p_{\mathrm{accept}}$ is the probability of accepting a gamble, and inverse decision entropy (iDE) is the negative Shannon entropy of $p_{\mathrm{accept}}$ and its complement $p_{\mathrm{reject}}$. In b) $p_{\mathrm{accept}}$ is plotted against subjective value to show how high values of inverse decision entropy characterise the tails of the sigmoid and bottoms out for middle values of SV, where $p_{\mathrm{accept}} = 0.5$. In c) the 2x2 table shows four different trial types based on whether the trial presents low or high values for each of the variables of interest, SV and iDE. Also, portending a main result, each cell presents the percentage of voxels that shows these specific conjunctions of effects.

Our results

With respect to our model, which included inverse decision entropy as another term, we only found sufficient evidence to support hypotheses 4, 5 and 6. These results are surprising as the literature might lead one to expect that effects should be stronger for hypotheses 1-3. Instead, our model-based analysis’s inclusion of the entropy term led to these hypotheses not being supported. Had we not included inverse decision entropy in the model, we would have also answered affirmatively to hypotheses 1 and 3.

The contrast in results for the standard and model-based analysis demonstrates the importance of model-based fMRI analyses in interpreting results. Unlike the previous cases considered, where model-based analysis revealed effects that would not otherwise be found, here including appropriate terms (e.g., entropy) led to effects no longer being observed. Given that uncertainty is a critical factor in this task, we believe that including this cognitive construct into the analysis provides a more accurate view on the data.

Rather than model-based analysis simply being a more technical analysis in some senses, it should be be seen as more conceptually correct when the cognitive model used captures important aspects of participants’ mental states. Although the model-based analysis we presented was very simple, it successfully leveraged the behavioural data to better understand the imaging data.

Model-based analysis giveth: Part II

From one perspective, the model-based analyses, which included a measure of entropy for each gamble decision, rendered a number of value effects non-significant. This is likely a good thing as the standard parametric analysis did not take into account important cognitive processes related to confidence, such as the cognitive model’s entropy measure.

From another perspective, the model-based analyses revealed a bunch of qualitatively new findings related to entropy. The entropy side of the story appears bigger and more exciting than the value one. To learn more about entropy and its relation to value, you can check out the poster we presented at SfN: The neural link between subjective value and decision entropy, or better yet, our bioRxiv preprint. There we focus on the importance of decision entropy with respect to subjective value (as opposed to gains and losses separately, as we have reported here).

Finally, we would like to thank the participants in the study, all the members of the 70 participating labs, and the NARPS organisers. In addition to providing such a fine dataset, a formal assessment of variability in the fMRI processing pipeline was long overdue. Organising such projects is a lot of work, but it needs to be done. We hope this blog helps advance the aims of understanding how analysis choices affect scientific conclusions.

Fast food science is a shit sandwich

Fri, 22 Feb 2019 00:00:00 +0000

When it comes to technology and communication, faster is usually considered better. For example, test pilot Chuck Yeager showed “The Right Stuff” by being the first person to break the sound barrier. We celebrate computer chips becoming faster. In the age of the internet, many people view real-time interactive communication, such as on Twitter (more on this later!), as highly desirable. However, faster is not always better. Resisting the obvious and puerile joke, fast food is a clear example of faster not being best – It has its place but no one with any taste or class would argue that its the highest quality food, nor fit for occasions that are about more than convenience.

An embarrassment of riches, on Twitter.

Yet, somehow when it comes to science, where one would think reflection and deep thought would be prized, a lot of the community seems to have moved toward the fast food model of thought. The ethos is that instant commenting and evaluation is somehow expediting science. It’s not, much like how the public is not better informed by virtue of the 24/7 news cycle. The science case is actually more insidious than sound-bite journalism as the scientists themselves are the ones who shape the story in their own ahistorical echo chambers.

I experienced this recently when a prominent journal reviewer, who we believe majorly lost the plot by confusing our theory paper as a methods paper, posted his negative review without our consent as a blog post (copycats followed) shortly after we received the journal rejection (see here and here for a discussion). This podcast has also been recommended to me on the issue.

Within moments, we were obliged to respond to defend ourselves. I spent years working on a project and somehow found myself completing and posting a public blog response within an hour, absurd. Fortunately, we got it right and our revision cements that case, which is how it usually goes when one spends years thinking about a project and critics don’t have that luxury.

What struck me was there was no actual scientific discourse on Twitter and there couldn’t be under these conditions for the type of work we do. It was chaotic. It was bizarre. We were tone policed by a social psychologist who didn’t seem to understand (skin in the game?) the situation but sure had to pick a side, all while ignoring the existence of the early-career-researcher (ECR) lead author. We had no time to digest people’s points and chart a response. Scientific debate on Twitter is akin to politicians trying to score points in an American-style Presidential debate. We scored our points for sure, but it’s not a game that we are interested in playing.

Social media can be cesspool open to abuse that stands in stark contrast to open review models (that all involve consent) at journals like eLife and computer science conferences like ICLR. In our view, science doesn’t truly progress from takedowns and hit-and-runs, but from people thinking deeply about what they are doing, often in light of the feedback from others when it can be fully and deeply processed. Open review should not be someone’s internet graffiti.

My first experience with poorly executed "open review" from decades ago, as not currently practiced in computer science or in quality journals. Hate the game.

When we had time to dissect the attack blogs, which we did solely out of thoroughness as we did not find the points particularly relevant, we discovered that our main critic’s pet measure that supposedly is His gift to us wasn’t even really suited to our approach which relies on rank information. Still, others parrot the points of our critic’s musings absent thought, like that similarity and classifier functions are one-and-the-same because there exists examples of both that compute covariance information, which is the logical equivalent of concluding that two distinct species that both eat bugs under certain circumstances are one-and-the-same.

The point here is that, while it took us only moments to appreciate this critic missed our global point, it took weeks to appreciate that even the specifics were off the mark. Yet, people in moments were firing away opinions as facts (mostly by parroting one person’s views), lecturing us by tweet how science works, and telling us to sit back and enjoy it to make the most of it, which was all rather rapey and authoritarian. It would be bad enough if confined to Twitterverse, but this garbage thinking sticks and colours discourse, much like fake news does.

Twitter can exacerbate conflict through its dark triad of instant interaction and feedback, brief responses, and occasionally mob dynamics. Some seem to think they are owed a response because they questioned you or went to the trouble of writing a blog about you or (a blog about a blog) to the Kleene star. Here’s a hint, no one is obligated to respond to others’ hot takes. It is not a sign of strength and intellectual integrity to wade through the morass. Not everything dignifies a response, and even when something does it is not necessarily worth one’s time. Our preferred response is our revised preprint. Often, insta responding is a poor use of time and the people on the receiving end usually aren’t really processing what is said anyway. These rapid interactions favour narcissist bullshit merchants, who are exactly the folks you don’t want running a field. Dealing with them is effectively a Denial of Service (DoS) attack on actual thought, which is not expediting science. The experience can be consuming in the moment, but the half-life of thoughts on Twitter is brief.

In this quote from Shakespeare, violent means sudden. Yeah, sure it does.

Of course, media like Twitter can play positive roles in science, such as providing a means, albeit with a biased sample, to learn about recent work and people’s views and meet new people. I have learned a lot from people sharing information in direct messages (DMs) as well, and then there is the light-hearted banter. In contrast, real-time debate, especially when it’s a takedown on a particular person or paper, is unlikely to have valuable content.

It might be in science that you get to choose two from FAST, PROPERLY EXECUTED, INNOVATIVE. In my estimation, the last attribute is what is often lacking and what often goes under appreciated. I am advocating for giving scientists the opportunity to actually think. That’s why I got into this line of work. Sometimes it means sitting silently and thinking something through for a couple days and eventually getting it right after repeatedly doing so for months. In the open office plan of science, when people are expected to engage in instant debates that are formulated along the wrong dimensions, I just don’t see anything very deep or useful being produced. So, here’s for something better than cold fast food at the science banquet. Choose your dining partners carefully.

Sebastian's Thoughts on Open Review

Thu, 10 Jan 2019 00:00:00 +0000

My name is Sebastian Bobadilla-Suarez and I am an early career researcher (ECR — postdoc’ing in the Love Lab). I did my PhD with Brad Love at UCL as well. This post is about recent events regarding the review process of our manuscript titled Measures of neural similarity. Our manuscript was submitted to a prestigious journal and went through a formal review process. It was rejected by the reviewers, which is fine, but one of the reviewers decided to post his review on his own blog. This was problematic for several reasons, see here for Brad’s response. Sam Schwarzkopf also shared his take too.

Before I go on, I want to say that I am entirely in favor of open debate of ideas, open science and fully deconstructing manuscripts. I fully encourage this. However, open science should not be used to maintain the status quo but to challenge it. Also, I really appreciate the time and effort that goes into providing feedback on manuscripts, whether in a formal review process or not. Although we may not always agree with reviews on a manuscript, they are always welcomed as useful in one way or another to improve the work.

After reading some of the threads on this, I’d like to give my two cents as first author. I was surprised to see how polarized the subject of open science can become. A lot of the discourse from certain individuals seems hopelessly Manichaeistic (e.g., “I’m for open science, you’re not”). I am for open science, as I said above, but I am also for understanding how new and open science systems impact those lower in the scientific ranks. I would assume we are all pro open science as a default but still working on best practices, including practices pertaining to open review. To be clear, the motivation for this blog post is not to sidestep the points made in the original review. The goal here is to share my experience and perspective.

I feel the review process has been tainted for this project; a project that I hold close to my heart as one of my favorite initiated during my PhD. This obviously makes me biased, but then again who is going to stand up for my work if not me? I understand that uploading your manuscript on a preprint server invites informal comment and feedback, which is one of the reasons to do it in the first place, and as I said I fully welcome and appreciate any and all comments on my work. However, posting a formal review as a blog post necessarily carries more weight than any feedback provided outside the formal review process, especially when posted by one of the leaders in the field. I did feel wronged by how one of our reviewers has handled the process. The junior authors were not given the professional courtesy of notification and none of the authors opted into this way of handling reviews — we had no opportunity to reply before rejection. Ultimately, is accepting to review a manuscript with the goal of eventually blogging about it (as opposed to improving it) a conflict of interest? I am still developing my opinions with respect to best practices in open science.

I see that transparency can have pitfalls when the line between formal review and public debate has been blurred (especially at early stages on the rocky road to getting published). The fact that this line was blurred in a non-consensual fashion when best practices have not been cast into convention yet is unacceptable. Nonetheless, the fact that we in the open science community can discuss these cases, push back against dominant power structures, and set precedents will be beneficial moving forward.

I hope there are lessons to be learned here in general and that in my specific case future reviewers may place appropriate weight on the posted review in question — avoiding the use of such a post as a heuristic to base their own opinions on, since I personally think the review misses the point of my preprint. Is it now impossible to claim that future reviews are somehow independent of each other given that the blog post in question was the outcome of a formal review process? I think it is now a case of how to appropriately contextualize them. Maybe proponents of open science, and open review specifically, have thought of such situations and how privileged voices can drown out the voices of more junior people, like me. My biggest hope for open science is that it will create a fairer and more accessible system for the new generation of scientists. This is only possible by having these debates on new ways of doing things.

An Open Review of Niko Kriegeskorte

Wed, 09 Jan 2019 00:00:00 +0000

Imagine you think and work carefully on an ambitious paper for a few years, trying to answer fundamental questions the field has overlooked. Now, imagine after a very long wait you receive negative reviews that completely missed the main point. Instead, the reviews project a certain person’s pet concerns, goals, and interests onto your work, which are only tangentially related to your central questions. Worst yet, this person has acolytes whose capacity to miss the broader picture is only surpassed by their self-righteousness. That would be frustrating and potentially career-changing for some.

Now, let’s add to this scenario that this reviewer personally emails you moments after your rejection absent kind words (he got the memo that empathy is passé), but demands that you agree with his viewpoint and alerts you that he will post an open review of your work. This is inhuman.

But, open is good right? It can be, but it can also be incredibly self-serving. The review is of course open when Niko wants it to be and it serves him. Was the review posted before the editor made the reject decision? Of course not, because then we could respond and potentially affect the decision. Was it posted after we had time to publish elsewhere? Of course not. It was posted at the darkest time for my team when we are in our most vulnerable position when there’s little time to respond and we have more important things to worry about. Nevertheless, we are obliged to respond because Niko has poisoned the well for our project. “Open” here is not to serve the community or the authors but to provide a cheap blog and attention for Niko for the limited number of manuscripts that it serves Niko to review. The reinforcing power structure here should be apparent to anyone clued in.

Our paper itself, which I encourage you to read and form your own opinion, is about the nature of neural similarity, namely what makes two brain states similar. The main questions are whether the brain’s preferred notion of similarity is different across regions and tasks. We find that the preferred similarity measures are common across regions but differ across tasks. This is cool. Of course, as we discuss, whatever measure is “best” (whatever that means and it does mean different things to different people) will depend on many issues, including data quality and quantity. We muse a bit on how these higher-level measures of similarity relate to underlying computations and representations. There’s been a ton of work in Psychology on what makes two stimuli similar, but in Neuroscience people largely default to a few options without any real evaluation. Thus, our work is very needed in the field and timely. We get traction on this neglected problem by using a decoding approach to approximate the information available in a brain state. We discuss how much this approximation should be trusted in light of our central questions, namely does the brain use the same similarity measure across regions and tasks.

From Figure 1, Bobadilla-Suarez et al. (2018): Families of similarity measures. (left panel) Similarity measures divide into those concerned with angle vs. magnitude differences between vectors. Pearson correlation whereas Euclidean distance are common angle and magnitude measures, respectively. The magnitude family further subdivides according to distributional assumptions. Measures like Mahalanobis are distributional in that they are sensitive to co-variance such that similarity falls more rapidly along low variance directions. (right panel) The choice of similarity measure can strongly affect inferences about neural representational spaces. In this example, stimuli a, b, and c elicit different patterns of activity across two voxels. When Pearson correlation is used, stimulus a is more similar to b than to c. However, when the Euclidean measure is used, the pattern reverses such that stimulus a is more similar to c than b.

Decidedly what we are not trying to do is determine which neural similarity measure has the best properties by some metric, such as split-half reliability, bias, whatever small methodological point is of interest to some. Of course, that is what primarily interests some, such as Niko, but these points are minor and largely inconsequential to our goals and conclusions. Niko provided a top-down reading of our work strictly through the lens of his interests that fails to engage with the main ideas of the paper. I leave it to the acolytes to review his papers, of which I am familiar. Again, please read our paper, rather than parrot Niko’s views.

As these sideshows entertain, fundamental questions about how to bridge from neurons to voxels to compact higher-level descriptions to computations remain unanswered. To make progress, the field needs leaders who are open to ideas and are broader thinkers. Of course, instead we have a system that entrenches and amplifies those in positions of power within the field. Rather than fall in line, my lab is trying to address these difficult and subtle questions. However, how can we make progress in the field when we are reviewed by people like Niko who doesn’t believe the brain has representations?

true, the brain does not need representations. it also doesn't need information or causality. it's a dynamical system after all. it's *us* who need causality, and information theory, and representational interpretations to understand the brain. https://t.co/8JatZUo1yt
— Kriegeskorte Lab (@KriegeskorteLab) August 12, 2018

While we can all laugh at the occasional pseudo profound cringe-inducing tweets by celebrities like Elon Musk, we should expect more from the leaders of our field. It’s intolerable for our scientific fate to be controlled by someone who is a Cartesian Dualist or is profoundly confused by levels of analysis. I am glad that Niko and others picked up on ideas from Roger Shepard and others from 1970 on second-order isomorphism and that they popularised others’ efforts to apply related ideas to the analysis of fMRI data. They have made a career out of correlating the upper diagonal of matrices and plodding through attendant concerns. Now it’s time to allow others to make progress and introduce new ideas into the literature.

Postscript: Lots of discussion on Twitter. To be clear, we are not against the eLife model of publishing reviews upon acceptance, nor are we against leaving comments on preprints, which can allow the authors to respond and perhaps make edits. We are against using the existence of a preprint as a pretext to write journal reviews which are really self-serving blog posts, especially when they are posted the moment one’s paper is rejected by the editor. This take on open reviewing is open to abuse and is not really open as the reviewer decides what, where, when and how. Furthermore, existing models of open review involve consent from all parties.

Also see the post by the first author too, here: Sebastian’s Thoughts on Open Review.

How a CogSci undergrad invented PageRank three years before Google

Sun, 10 Dec 2017 00:00:00 +0000

Before Google, search engines, like AltaVista, often retrieved spurious web pages. Out of all the possible pages to return how does one determine which ones are the most relevant? One key to Google’s success was the PageRank algorithm developed by Google founders Sergey Brin and Larry Page in 1998. As they say, the rest is history, except there was a curious prehistory.

Three years prior in 1995, while an undergrad in Brown’s Cognitive and Linguistic Sciences program, I published an identical algorithm to PageRank, so I guess it would be more correct to say that Brin and Page published an algorithm identical to the Love and Sloman centrality algorithm. At the time, I was a Mathematics and Computer Science major that switched over to the Cognitive and Linguistic Sciences program because I wanted to understand which algorithms the human mind used to solve interesting problems. The story of my undergraduate honors thesis highlights how thinking about how the mind works can be useful for solving practical problems.

Returning to the centrality measure, the goal was to determine which parts of concepts were most central or important to people. The idea I had was that people view nodes in human concepts as more central to the extent that other nodes depend on them. For example, in the graph below of our concept of Robin (collected from human participants), Beak should be somewhat central because Eats depends on it. Like PageRank, indirect connections also influence centrality. For example, Eats depends on Beak and Living depends on eats, i.e., Living → Eats → Beak, which should have the effect of making Beak even more central to our conception of a Robin. To take into account all of these influences, the centrality algorithm iteratively computes how central a node is, taking into account its place in the overall dependency graph. With some mathematics background, I worked out that this iterative algorithm converges to the Eigen vector with the largest Eigen value in the dependency matrix (all the links can be represented as a matrix).

An example dependency (link) graph from Love and Sloman (1995).

PageRank is identical, but instead of working on a graph for a human concept it works on the links in the world wide web; simply replace concept node with webpage and dependency link with hyperlink. The goal of each algorithm is the same, to determine which nodes in a network are most central. Here is a good description of the math and ideas behind PageRank (i.e., the centrality algorithm) for those who want to know more.

One wonders whether other ideas are lying in the cognitive science dustbin awaiting rediscovery. The field itself is largely driven by fads and is prone to ignore genuine discoveries. That year at the Cognitive Science Society (CSS) conference my paper was well received but did not make a big splash. At the time, CSS folks were excited about connectionism and a paper on that topic won best student paper. Of course, that trend gave way to Bayesianism which has or will likely give way to deep learning. CSS tends to be fad-driven, which is one of the several reasons I resigned from the CSS last summer, but that is a topic for another blog post.

My undergraduate thesis is not a unique case of cognitive science research being relevant to machine learning research. The backpropagation algorithm, which is behind the past and current neural networks revolution, was developed by cognitive scientists. In addition, John R. Anderson independently discovered the Dirichlet process mixture for effective Bayesian clustering. And of course, the current excitement about the convolutional neural network architecture (trained with backpropgation) is motived by basic insights on how the human visual system is organized.

In these examples, establishing a connection to machine learning was possible because the cognitive science research was formal. Perhaps one lesson is that more students in cognitive science should seek training in formal methods. Another lesson is that computer scientists may be well served from some contact with cognitive science. Facetiously as much as seriously, a final lesson for any potential benefactors with deep pockets is to contact me because I have some more good ideas waiting on the shelf! 😊

Inclusive, Productive, Accountable

Thu, 06 Jul 2017 00:00:00 +0000

Slogans can prove hollow or can invite one to reflect on core values. Aiming for the latter, our new lab motto is Inclusive, Productive, Accountable (IPA). We aim for a community where everyone is hoppy, I mean happy, no matter their drink of choice. In seriousness, for the IPA motto to have a positive effect, what it concretely means needs to be clear.

Inclusivity

This amounts to not forming cliques that exclude others within lab (even inadvertently). The worst case scenario is creating virtual labs within the lab. Unfortunately, like high school students, this is what people in a workplace will naturally gravitate towards when they don’t take care.

What does this mean concretely? It means a lot, but here are some concrete examples for guidance:

Use the lab email list (not a personal email list) to advertise events so as to not exclude people.
Use the lab calendar to try to schedule these events when people are around where possible.
If you are going somewhere with a bunch of people from lab (e.g., a talk, drinks, lunch), wonder why you are not inviting everyone. I am not saying that people can’t have favourites in lab to spend time with, but one should wonder when half the lab is somewhere and there was no group invite. This is not to say that people are obligated to join group events, but everyone should always feel welcomed to do so and in the loop.
Don’t refer to anything as a lab event unless everyone in lab was invited with sufficient notice and there is some decent probability the majority of people would have an interest in attending.

That’s just a sample of what is a nebulous concept — welcoming and including everyone in lab such that there is one lab, not several factions.

Productivity

Knowing what the ultimate goal is and efficiently (in time and other resources) working toward it. This concept requires more unpacking, but notice productivity is more than “doing a lot of stuff” and working endless hours. Productivity requires alignment with lab goals and outputs.

Accountability

Taking responsibility for what falls within your realm; Being straightforward when you fall short (e.g., admit mistakes and seek to correct) as opposed to shifting blame; Doing what you say (agreed) you would do, which amounts to being trustworthy; Accepting consequences of one’s actions (or lack thereof).

Fear Humans, Not Artificial Intelligence

Sun, 15 May 2016 00:00:00 +0000

Are we on the cusp of creating super-intelligent machines? Would such a super-intelligence put humanity at existential risk? Certainly, leaders in academia and industry are convinced that the danger of our own creations turning on us is real. For example, Elon Musk, founder of Tesla Motors and SpaceX, has set up a billion dollar non-profit company with contributions from tech titans, such as Amazon, to prevent an evil Artificial Intelligence (AI) from bringing about the end of humanity. Universities, such as MIT, Oxford, and Cambridge, have established institutes to address the issue. Luminaries like Bill Joy, Bill Gates, and Stephen Hawking have all raised the alarm.

The end would appear nigh unless we act before it’s too late. Alternatively, perhaps science fiction and industry-fuelled hype have overcome better judgment. The cynic might say that this doomsday vision has taken on religious proportions. While previous generations dreamed of exploring the stars and interacting with alien species, the current technological and cultural zeitgeist is decidedly dystopian. This vision is buoyed in the tech world because it feeds egos — what conceit could be greater than believing one’s work could usher in such rapid innovation that history as we know it ends? No longer are tech figures cast as mere business leaders, but instead as gods who will determine the future of humanity and beyond. By the same token, frontline tech workers can reconceptualise their lives as involving something larger than properly pairing ads with cat videos using efficient algorithms. Who could blame them? For Judgment Day researchers, proclamations of an “existential threat” is not just a call to action, but a call to be funded generously and an opportunity to rub shoulders with the tech elite.

So, are smart machines more likely to kill us, save us, or simply drive us to work? To answer this question, it helps to step back and look at what is actually happening in AI. The basic technologies, such as those recently employed by Google’s DeepMind to defeat a human expert at the game Go, are simply refinements of technologies developed in the 1980s. There has been no qualitative breakthrough in approach. Instead, performance gains are attributable to larger training sets (also known as Big Data) and increased processing power. What is unchanged is that most machine systems work by maximising some kind of objective. In a game, the objective is simply to win, which is formally defined (e.g., capture the king in chess). This is one reason why games (checkers, chess, Go) are AI mainstays — it’s easy to specify the objective function.

In other cases, it may be harder to define the objective and this is where AI could go wrong. However, AI is more likely go wrong for reasons of incompetence rather than malice. For example, imagine that the US nuclear arsenal during the Cold War was under control of an AI to thwart sneak attack by the Soviet Union. Due to no action of the Soviet Union, a nuclear reactor meltdown occurs and the power grid temporarily collapses. The AI’s sensors detect the disruption and fallout, leading the system to infer an attack is underway. The President instructs the system in a shaky voice to stand down, but the AI takes the troubled voice as evidence the President is being coerced. Missiles released. End of humanity. The AI was simply following its programming, which led to a catastrophic error. This is exactly the kind of deadly mistakes that humans almost made during the Cold War. Our destruction would be attributable to our own incompetency rather than an evil AI turning on us, no different than an auto-pilot malfunctioning on a jumbo jet and sending its unfortunate passengers to their doom. In contrast, human pilots have purposefully killed their passengers, so perhaps we should welcome self-driving cars.

Of course, humans could design AIs to kill, but again this is people killing each other, not some self-aware machine deciding on this course of action. Western governments have already released computer viruses, such as Stuxnet, to target critical industrial infrastructure. Future viruses could be more clever and deadly. However, none of this is new and essentially follows the arc of history where humans use available technologies to kill one another. The AI would simply be following its programming and objective function.

Apart from people using AI to kill one another (rather than the AI deciding to do it itself), there are real dangers from AI, but these dangers are economic and social in nature. Clever AI will create tremendous wealth for society, but will leave many people without jobs. Unlike the industrial revolution, there may not be jobs for segments of society as machines may be better at every possible job. There will not be a flood of replacement “AI repair person” jobs to take up the slack. Already, high-tech companies with massive valuations, such as Facebook, employ relatively few people. For a historical perspective, it is also important to keep in mind that the Luddites were not irrational — they did lose their high-paying textile jobs to machines and it took two generations before their descendants’ wages reached what they earned.

There will be many losers as AI improves. Of course, these losses will be more than offset by gains in productivity, so the problem is essentially a political, social, and economic challenge on how to properly assist those (most of us?) who will be displaced by machines. Notice the danger here is not being killed by machines, but rather outcompeted by people employing machines. Even that is not really the danger — the true danger is that people will not look after one another as machines permanently displace entire classes of labor.

In summary, we should focus on the very real challenges to our survival, such as climate change, weapons of mass destruction, etc., not fanciful killer AI robots. For the foreseeable future, machines will remain a threat only when directed by humans by malice or through carelessness. The real challenges we face from AI are economic and social, but these are human problems that do not involve defending ourselves against sentient killer robots that have turned on us. If the machines ever come for us, they will likely be sent by other humans rather than deciding on their own accord. Taxi and delivery drivers will be made redundant by machines, not mowed down. In almost all cases, the dangers of AI are really about humans behaving badly.