Search | arXiv e-print repository

Generative Adversarial Collaborations: A practical guide for conference organizers and participating scientists

Authors: Gunnar Blohm, Benjamin Peters, Ralf Haefner, Leyla Isik, Nikolaus Kriegeskorte, Jennifer S. Lieberman, Carlos R. Ponce, Gemma Roig, Megan A. K. Peters

Abstract: Generative adversarial collaborations (GACs) are a form of formal teamwork between groups of scientists with diverging views. The goal of GACs is to identify and ultimately resolve the most important challenges, controversies, and exciting theoretical and empirical debates in a given research field. A GAC team would develop specific, agreed-upon avenues to resolve debates in order to move a field… ▽ More Generative adversarial collaborations (GACs) are a form of formal teamwork between groups of scientists with diverging views. The goal of GACs is to identify and ultimately resolve the most important challenges, controversies, and exciting theoretical and empirical debates in a given research field. A GAC team would develop specific, agreed-upon avenues to resolve debates in order to move a field of research forward in a collaborative way. Such adversarial collaborations have many benefits and opportunities but also come with challenges. Here, we use our experience from (1) creating and running the GAC program for the Cognitive Computational Neuroscience (CCN) conference and (2) implementing and leading GACs on particular scientific problems to provide a practical guide for future GAC program organizers and leaders of individual GACs. △ Less

Submitted 19 February, 2024; originally announced February 2024.

arXiv:2401.06005 [pdf, other]

How does the primate brain combine generative and discriminative computations in vision?

Authors: Benjamin Peters, James J. DiCarlo, Todd Gureckis, Ralf Haefner, Leyla Isik, Joshua Tenenbaum, Talia Konkle, Thomas Naselaris, Kimberly Stachenfeld, Zenna Tavares, Doris Tsao, Ilker Yildirim, Nikolaus Kriegeskorte

Abstract: Vision is widely understood as an inference problem. However, two contrasting conceptions of the inference process have each been influential in research on biological vision as well as the engineering of machine vision. The first emphasizes bottom-up signal flow, describing vision as a largely feedforward, discriminative inference process that filters and transforms the visual information to remo… ▽ More Vision is widely understood as an inference problem. However, two contrasting conceptions of the inference process have each been influential in research on biological vision as well as the engineering of machine vision. The first emphasizes bottom-up signal flow, describing vision as a largely feedforward, discriminative inference process that filters and transforms the visual information to remove irrelevant variation and represent behaviorally relevant information in a format suitable for downstream functions of cognition and behavioral control. In this conception, vision is driven by the sensory data, and perception is direct because the processing proceeds from the data to the latent variables of interest. The notion of "inference" in this conception is that of the engineering literature on neural networks, where feedforward convolutional neural networks processing images are said to perform inference. The alternative conception is that of vision as an inference process in Helmholtz's sense, where the sensory evidence is evaluated in the context of a generative model of the causal processes giving rise to it. In this conception, vision inverts a generative model through an interrogation of the evidence in a process often thought to involve top-down predictions of sensory data to evaluate the likelihood of alternative hypotheses. The authors include scientists rooted in roughly equal numbers in each of the conceptions and motivated to overcome what might be a false dichotomy between them and engage the other perspective in the realm of theory and experiment. The primate brain employs an unknown algorithm that may combine the advantages of both conceptions. We explain and clarify the terminology, review the key empirical evidence, and propose an empirical research program that transcends the dichotomy and sets the stage for revealing the mysterious hybrid algorithm of primate vision. △ Less

Submitted 11 January, 2024; originally announced January 2024.

arXiv:2208.10668 [pdf, other]

doi 10.51628/001c.37507

Beyond linear regression: map** models in cognitive neuroscience should align with research goals

Authors: Anna A. Ivanova, Martin Schrimpf, Stefano Anzellotti, Noga Zaslavsky, Evelina Fedorenko, Leyla Isik

Abstract: Many cognitive neuroscience studies use large feature sets to predict and interpret brain activity patterns. Feature sets take many forms, from human stimulus annotations to representations in deep neural networks. Of crucial importance in all these studies is the map** model, which defines the space of possible relationships between features and neural data. Until recently, most encoding and de… ▽ More Many cognitive neuroscience studies use large feature sets to predict and interpret brain activity patterns. Feature sets take many forms, from human stimulus annotations to representations in deep neural networks. Of crucial importance in all these studies is the map** model, which defines the space of possible relationships between features and neural data. Until recently, most encoding and decoding studies have used linear map** models. Increasing availability of large datasets and computing resources has recently allowed some researchers to employ more flexible nonlinear map** models instead; however, the question of whether nonlinear map** models can yield meaningful scientific insights remains debated. Here, we discuss the choice of a map** model in the context of three overarching desiderata: predictive accuracy, interpretability, and biological plausibility. We show that, contrary to popular intuition, these desiderata do not map cleanly onto the linear/nonlinear divide; instead, each desideratum can refer to multiple research goals, each of which imposes its own constraints on the map** model. Moreover, we argue that, instead of categorically treating the map** models as linear or nonlinear, we should instead aim to estimate the complexity of these models. We show that, in many cases, complexity provides a more accurate reflection of restrictions imposed by various research goals. Finally, we outline several complexity metrics that can be used to effectively evaluate map** models. △ Less

Submitted 22 August, 2022; originally announced August 2022.

Comments: Accepted at Neurons, Brain, Data, and Theory

Journal ref: Neurons, Behavior, Data analysis, and Theory, 2022

arXiv:2201.07372 [pdf, other]

Prospective Learning: Principled Extrapolation to the Future

Authors: Ashwin De Silva, Rahul Ramesh, Lyle Ungar, Marshall Hussain Shuler, Noah J. Cowan, Michael Platt, Chen Li, Leyla Isik, Seung-Eon Roh, Adam Charles, Archana Venkataraman, Brian Caffo, Javier J. How, Justus M Kebschull, John W. Krakauer, Maxim Bichuch, Kaleab Alemayehu Kinfu, Eva Yezerets, Dinesh Jayaraman, Jong M. Shin, Soledad Villar, Ian Phillips, Carey E. Priebe, Thomas Hartung, Michael I. Miller , et al. (18 additional authors not shown)

Abstract: Learning is a process which can update decision rules, based on past experience, such that future performance improves. Traditionally, machine learning is often evaluated under the assumption that the future will be identical to the past in distribution or change adversarially. But these assumptions can be either too optimistic or pessimistic for many problems in the real world. Real world scenari… ▽ More Learning is a process which can update decision rules, based on past experience, such that future performance improves. Traditionally, machine learning is often evaluated under the assumption that the future will be identical to the past in distribution or change adversarially. But these assumptions can be either too optimistic or pessimistic for many problems in the real world. Real world scenarios evolve over multiple spatiotemporal scales with partially predictable dynamics. Here we reformulate the learning problem to one that centers around this idea of dynamic futures that are partially learnable. We conjecture that certain sequences of tasks are not retrospectively learnable (in which the data distribution is fixed), but are prospectively learnable (in which distributions may be dynamic), suggesting that prospective learning is more difficult in kind than retrospective learning. We argue that prospective learning more accurately characterizes many real world problems that (1) currently stymie existing artificial intelligence solutions and/or (2) lack adequate explanations for how natural intelligences solve them. Thus, studying prospective learning will lead to deeper insights and solutions to currently vexing challenges in both natural and artificial intelligences. △ Less

Submitted 13 July, 2023; v1 submitted 18 January, 2022; originally announced January 2022.

Comments: Accepted at the 2nd Conference on Lifelong Learning Agents (CoLLAs), 2023

arXiv:2011.04245 [pdf, ps, other]

On the Index of Diffie-Hellman Map**

Authors: Leyla Işık, Arne Winterhof

Abstract: Let $γ$ be a generator of a cyclic group $G$ of order $n$. The least index of a self-map** $f$ of $G$ is the index of the largest subgroup $U$ of $G$ such that $f(x)x^{-r}$ is constant on each coset of $U$ for some positive integer~$r$. We determine the index of the univariate Diffie-Hellman map** $d(γ^a)=γ^{a^2}$, $a=0,1,\ldots,n-1$, and show that any map** of small index coincides with~… ▽ More Let $γ$ be a generator of a cyclic group $G$ of order $n$. The least index of a self-map** $f$ of $G$ is the index of the largest subgroup $U$ of $G$ such that $f(x)x^{-r}$ is constant on each coset of $U$ for some positive integer~$r$. We determine the index of the univariate Diffie-Hellman map** $d(γ^a)=γ^{a^2}$, $a=0,1,\ldots,n-1$, and show that any map** of small index coincides with~$d$ only on a small subset of $G$. Moreover, we prove similar results for the bivariate Diffie-Hellman map** $D(γ^a,γ^b)=γ^{ab}$, $a,b=0,1,\ldots,n-1$. In the special case that $G$ is a subgroup of the multiplicative group of a finite field we present improvements. △ Less

Submitted 9 November, 2020; originally announced November 2020.

arXiv:1703.09151 [pdf, ps, other]

Maximum-order Complexity and Correlation Measures

Authors: Leyla Işık, Arne Winterhof

Abstract: We estimate the maximum-order complexity of a binary sequence in terms of its correlation measures. Roughly speaking, we show that any sequence with small correlation measure up to a sufficiently large order $k$ cannot have very small maximum-order complexity. We estimate the maximum-order complexity of a binary sequence in terms of its correlation measures. Roughly speaking, we show that any sequence with small correlation measure up to a sufficiently large order $k$ cannot have very small maximum-order complexity. △ Less

Submitted 27 March, 2017; originally announced March 2017.

arXiv:1701.06158 [pdf, ps, other]

A Note on Value Sets of Polynomials over Finite Fields

Authors: Leyla Işık, Alev Topuzoğlu

Abstract: Most results on the value sets $V_f$ of polynomials $f \in \mathbb{F}_q[x]$ relate the cardinality $|V_f|$ to the degree of $f$. In particular, the structure of the spectrum of the class of polynomials of a fixed degree $d$ is rather well known. We consider a class $\mathcal{F}_{q,n}$ of polynomials, which we obtain by modifying linear permutations at $n$ points. The study of the spectrum of… ▽ More Most results on the value sets $V_f$ of polynomials $f \in \mathbb{F}_q[x]$ relate the cardinality $|V_f|$ to the degree of $f$. In particular, the structure of the spectrum of the class of polynomials of a fixed degree $d$ is rather well known. We consider a class $\mathcal{F}_{q,n}$ of polynomials, which we obtain by modifying linear permutations at $n$ points. The study of the spectrum of $\mathcal{F}_{q,n}$ enables us to obtain a simple description of polynomials $F \in \mathcal{F}_{q,n}$ with prescribed $V_F$, especially those avoiding a given set, like cosets of subgroups of the multiplicative group $\mathbb{F}_q^*$. The value set count for such $F$ can also be determined. This yields polynomials with evenly distributed values, which have small maximum count. △ Less

Submitted 22 January, 2017; originally announced January 2017.

arXiv:1611.06361 [pdf, ps, other]

Carlitz Rank and Index of Permutation Polynomials

Authors: Leyla Işık, Arne Winterhof

Abstract: Carlitz rank and index are two important measures for the complexity of a permutation polynomial $f(x)$ over the finite field $\F_q$. In particular, for cryptographic applications we need both, a high Carlitz rank and a high index. In this article we study the relationship between Carlitz rank $Crk(f)$ and index $Ind(f)$. More precisely, if the permutation polynomial is neither close to a polynomi… ▽ More Carlitz rank and index are two important measures for the complexity of a permutation polynomial $f(x)$ over the finite field $\F_q$. In particular, for cryptographic applications we need both, a high Carlitz rank and a high index. In this article we study the relationship between Carlitz rank $Crk(f)$ and index $Ind(f)$. More precisely, if the permutation polynomial is neither close to a polynomial of the form $ax$ nor a rational function of the form $ax^{-1}$, then we show that $Crk(f)>q- \max\{3 Ind(f),(3q)^{1/2}\}$. Moreover we show that the permutation polynomial which represents the discrete logarithm guarantees both a large index and a large Carlitz rank. △ Less

Submitted 19 November, 2016; originally announced November 2016.

MSC Class: 11T06; 11T24; 11T41; 11T71

arXiv:1606.04698 [pdf]

doi 10.1371/journal.pcbi.1005859

Invariant recognition drives neural representations of action sequences

Authors: Andrea Tacchetti, Leyla Isik, Tomaso Poggio

Abstract: Recognizing the actions of others from visual stimuli is a crucial aspect of human visual perception that allows individuals to respond to social cues. Humans are able to identify similar behaviors and discriminate between distinct actions despite transformations, like changes in viewpoint or actor, that substantially alter the visual appearance of a scene. This ability to generalize across comple… ▽ More Recognizing the actions of others from visual stimuli is a crucial aspect of human visual perception that allows individuals to respond to social cues. Humans are able to identify similar behaviors and discriminate between distinct actions despite transformations, like changes in viewpoint or actor, that substantially alter the visual appearance of a scene. This ability to generalize across complex transformations is a hallmark of human visual intelligence. Advances in understanding motion perception at the neural level have not always translated in precise accounts of the computational principles underlying what representation our visual cortex evolved or learned to compute. Here we test the hypothesis that invariant action discrimination might fill this gap. Recently, the study of artificial systems for static object perception has produced models, CNNs, that achieve human level performance in complex discriminative tasks. Within this class of models, architectures that better support invariant object recognition also produce image representations that match those implied by human and primate neural data. However, whether these models produce representations of action sequences that support recognition across complex transformations and closely follow neural representations remains unknown. Here we show that spatiotemporal CNNs appropriately categorize video stimuli into actions, and that deliberate model modifications that improve performance on an invariant action recognition task lead to data representations that better match human neural recordings. Our results support our hypothesis that performance on invariant discrimination dictates the neural representations of actions computed by human visual cortex. △ Less

Submitted 20 April, 2017; v1 submitted 15 June, 2016; originally announced June 2016.

arXiv:1604.07710 [pdf, ps, other]

Complete map**s and Carlitz rank

Authors: Leyla Işık, Alev Topuzoğlu, Arne Winterhof

Abstract: The well-known Chowla and Zassenhaus conjecture, proven by Cohen in 1990, states that for any $d\ge 2$ and any prime $p>(d^2-3d+4)^2$ there is no complete map** polynomial in $\mathbb{F}_{p}[x]$ of degree $d$. For arbitrary finite fields $\mathbb{F}_{q}$, we give a similar result in terms of the Carlitz rank of a permutation polynomial rather than its degree. We prove that if… ▽ More The well-known Chowla and Zassenhaus conjecture, proven by Cohen in 1990, states that for any $d\ge 2$ and any prime $p>(d^2-3d+4)^2$ there is no complete map** polynomial in $\mathbb{F}_{p}[x]$ of degree $d$. For arbitrary finite fields $\mathbb{F}_{q}$, we give a similar result in terms of the Carlitz rank of a permutation polynomial rather than its degree. We prove that if $n<\lfloor q/2\rfloor$, then there is no complete map** in $\mathbb{F}_{q}[x]$ of Carlitz rank $n$ of small linearity. We also determine how far permutation polynomials $f$ of Carlitz rank $n<\lfloor q/2\rfloor$ are from being complete, by studying value sets of $f+x.$ We provide examples of complete map**s if $n=\lfloor q/2\rfloor$, which shows that the above bound cannot be improved in general. △ Less

Submitted 26 April, 2016; originally announced April 2016.

arXiv:1601.01358 [pdf]

Fast, invariant representation for human action in the visual system

Authors: Leyla Isik, Andrea Tacchetti, Tomaso Poggio

Abstract: Humans can effortlessly recognize others' actions in the presence of complex transformations, such as changes in viewpoint. Several studies have located the regions in the brain involved in invariant action recognition, however, the underlying neural computations remain poorly understood. We use magnetoencephalography (MEG) decoding and a dataset of well-controlled, naturalistic videos of five act… ▽ More Humans can effortlessly recognize others' actions in the presence of complex transformations, such as changes in viewpoint. Several studies have located the regions in the brain involved in invariant action recognition, however, the underlying neural computations remain poorly understood. We use magnetoencephalography (MEG) decoding and a dataset of well-controlled, naturalistic videos of five actions (run, walk, jump, eat, drink) performed by different actors at different viewpoints to study the computational steps used to recognize actions across complex transformations. In particular, we ask when the brain discounts changes in 3D viewpoint relative to when it initially discriminates between actions. We measure the latency difference between invariant and non-invariant action decoding when subjects view full videos as well as form-depleted and motion-depleted stimuli. Our results show no difference in decoding latency or temporal profile between invariant and non-invariant action recognition in full videos. However, when either form or motion information is removed from the stimulus set, we observe a decrease and delay in invariant action decoding. Our results suggest that the brain recognizes actions and builds invariance to complex transformations at the same time, and that both form and motion information are crucial for fast, invariant action recognition. △ Less

Submitted 15 August, 2017; v1 submitted 6 January, 2016; originally announced January 2016.

arXiv:1406.1770 [pdf, other]

Computational role of eccentricity dependent cortical magnification

Authors: Tomaso Poggio, Jim Mutch, Leyla Isik

Abstract: We develop a sampling extension of M-theory focused on invariance to scale and translation. Quite surprisingly, the theory predicts an architecture of early vision with increasing receptive field sizes and a high resolution fovea -- in agreement with data about the cortical magnification factor, V1 and the retina. From the slope of the inverse of the magnification factor, M-theory predicts a corti… ▽ More We develop a sampling extension of M-theory focused on invariance to scale and translation. Quite surprisingly, the theory predicts an architecture of early vision with increasing receptive field sizes and a high resolution fovea -- in agreement with data about the cortical magnification factor, V1 and the retina. From the slope of the inverse of the magnification factor, M-theory predicts a cortical "fovea" in V1 in the order of $40$ by $40$ basic units at each receptive field size -- corresponding to a foveola of size around $26$ minutes of arc at the highest resolution, $\approx 6$ degrees at the lowest resolution. It also predicts uniform scale invariance over a fixed range of scales independently of eccentricity, while translation invariance should depend linearly on spatial frequency. Bouma's law of crowding follows in the theory as an effect of cortical area-by-cortical area pooling; the Bouma constant is the value expected if the signature responsible for recognition in the crowding experiments originates in V2. From a broader perspective, the emerging picture suggests that visual recognition under natural conditions takes place by composing information from a set of fixations, with each fixation providing recognition from a space-scale image fragment -- that is an image patch represented at a set of increasing sizes and decreasing resolutions. △ Less

Submitted 6 June, 2014; originally announced June 2014.

Report number: CBMM memo 17

Showing 1–12 of 12 results for author: Isik, L