-
Location-based Radiology Report-Guided Semi-supervised Learning for Prostate Cancer Detection
Authors:
Alex Chen,
Nathan Lay,
Stephanie Harmon,
Kutsev Ozyoruk,
Enis Yilmaz,
Brad J. Wood,
Peter A. Pinto,
Peter L. Choyke,
Baris Turkbey
Abstract:
Prostate cancer is one of the most prevalent malignancies in the world. While deep learning has potential to further improve computer-aided prostate cancer detection on MRI, its efficacy hinges on the exhaustive curation of manually annotated images. We propose a novel methodology of semisupervised learning (SSL) guided by automatically extracted clinical information, specifically the lesion locat…
▽ More
Prostate cancer is one of the most prevalent malignancies in the world. While deep learning has potential to further improve computer-aided prostate cancer detection on MRI, its efficacy hinges on the exhaustive curation of manually annotated images. We propose a novel methodology of semisupervised learning (SSL) guided by automatically extracted clinical information, specifically the lesion locations in radiology reports, allowing for use of unannotated images to reduce the annotation burden. By leveraging lesion locations, we refined pseudo labels, which were then used to train our location-based SSL model. We show that our SSL method can improve prostate lesion detection by utilizing unannotated images, with more substantial impacts being observed when larger proportions of unannotated images are used.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
3D Scene Geometry Estimation from 360$^\circ$ Imagery: A Survey
Authors:
Thiago Lopes Trugillo da Silveira,
Paulo Gamarra Lessa Pinto,
Jeffri Erwin Murrugarra Llerena,
Claudio Rosito Jung
Abstract:
This paper provides a comprehensive survey on pioneer and state-of-the-art 3D scene geometry estimation methodologies based on single, two, or multiple images captured under the omnidirectional optics. We first revisit the basic concepts of the spherical camera model, and review the most common acquisition technologies and representation formats suitable for omnidirectional (also called 360…
▽ More
This paper provides a comprehensive survey on pioneer and state-of-the-art 3D scene geometry estimation methodologies based on single, two, or multiple images captured under the omnidirectional optics. We first revisit the basic concepts of the spherical camera model, and review the most common acquisition technologies and representation formats suitable for omnidirectional (also called 360$^\circ$, spherical or panoramic) images and videos. We then survey monocular layout and depth inference approaches, highlighting the recent advances in learning-based solutions suited for spherical data. The classical stereo matching is then revised on the spherical domain, where methodologies for detecting and describing sparse and dense features become crucial. The stereo matching concepts are then extrapolated for multiple view camera setups, categorizing them among light fields, multi-view stereo, and structure from motion (or visual simultaneous localization and map**). We also compile and discuss commonly adopted datasets and figures of merit indicated for each purpose and list recent results for completeness. We conclude this paper by pointing out current and future trends.
△ Less
Submitted 17 January, 2024;
originally announced January 2024.
-
Using Zero-shot Prompting in the Automatic Creation and Expansion of Topic Taxonomies for Tagging Retail Banking Transactions
Authors:
Daniel de S. Moraes,
Pedro T. C. Santos,
Polyana B. da Costa,
Matheus A. S. Pinto,
Ivan de J. P. Pinto,
Álvaro M. G. da Veiga,
Sergio Colcher,
Antonio J. G. Busson,
Rafael H. Rocha,
Rennan Gaio,
Rafael Miceli,
Gabriela Tourinho,
Marcos Rabaioli,
Leandro Santos,
Fellipe Marques,
David Favaro
Abstract:
This work presents an unsupervised method for automatically constructing and expanding topic taxonomies using instruction-based fine-tuned LLMs (Large Language Models). We apply topic modeling and keyword extraction techniques to create initial topic taxonomies and LLMs to post-process the resulting terms and create a hierarchy. To expand an existing taxonomy with new terms, we use zero-shot promp…
▽ More
This work presents an unsupervised method for automatically constructing and expanding topic taxonomies using instruction-based fine-tuned LLMs (Large Language Models). We apply topic modeling and keyword extraction techniques to create initial topic taxonomies and LLMs to post-process the resulting terms and create a hierarchy. To expand an existing taxonomy with new terms, we use zero-shot prompting to find out where to add new nodes, which, to our knowledge, is the first work to present such an approach to taxonomy tasks. We use the resulting taxonomies to assign tags that characterize merchants from a retail bank dataset. To evaluate our work, we asked 12 volunteers to answer a two-part form in which we first assessed the quality of the taxonomies created and then the tags assigned to merchants based on that taxonomy. The evaluation revealed a coherence rate exceeding 90% for the chosen taxonomies. The taxonomies' expansion with LLMs also showed exciting results for parent node prediction, with an f1-score above 70% in our taxonomies.
△ Less
Submitted 11 February, 2024; v1 submitted 7 January, 2024;
originally announced January 2024.
-
A Review on Cryptocurrency Transaction Methods for Money Laundering
Authors:
Hugo Almeida,
Pedro Pinto,
Ana Fernández Vilas
Abstract:
Cryptocurrencies are considered relevant assets and they are currently used as an investment or to carry out transactions. However, specific characteristics commonly associated with the cryptocurrencies such as irreversibility, immutability, decentralized architecture, absence of control authority, mobility, and pseudo-anonymity make them appealing for money laundering activities. Thus, the collec…
▽ More
Cryptocurrencies are considered relevant assets and they are currently used as an investment or to carry out transactions. However, specific characteristics commonly associated with the cryptocurrencies such as irreversibility, immutability, decentralized architecture, absence of control authority, mobility, and pseudo-anonymity make them appealing for money laundering activities. Thus, the collection and characterization of current cryptocurrency-based methods used for money laundering are paramount to understanding the circulation flows of physical and digital money and preventing this illegal activity. In this paper, a collection of cryptocurrency transaction methods is presented and distributed through the money laundering life cycle. Each method is analyzed and classified according to the phase of money laundering it corresponds to. The result of this article may in the future help design efficient strategies to prevent illegal money laundering activities.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
Fejér monotone sequences revisited
Authors:
Ulrich Kohlenbach,
Pedro Pinto
Abstract:
In this paper we introduce a localized and relativized generalization of the usual concept of Fejér monotonicity together with uniform and quantitative versions thereof and show that the main quantitative results obtained by the 1st author together with Nicolae and Leuştean in 2018 and with López-Acedo and Nicolae in 2019 respectively, extend to this generalization. Our framework, in particular, c…
▽ More
In this paper we introduce a localized and relativized generalization of the usual concept of Fejér monotonicity together with uniform and quantitative versions thereof and show that the main quantitative results obtained by the 1st author together with Nicolae and Leuştean in 2018 and with López-Acedo and Nicolae in 2019 respectively, extend to this generalization. Our framework, in particular, covers the sequence generated by the Dykstra algorithm while the latter is not Fejér-monotone in the ordinary sense. This gives a theoretical explanation why under a metric regularity assumption one obtains an explicit rate of convergence for Dykstra's algorithm which was proved recently by the 2nd author.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
Inverse design of self-folding 3D shells
Authors:
Diogo E. P. Pinto,
Nuno A. M. Araújo,
Petr Šulc,
John Russo
Abstract:
Inverse design aims at the development of elementary building blocks that organize spontaneously into target shapes. In self-assembly, the blocks diffuse to their target position. Alternatively, recent experiments point to a more robust process in which the shape is formed from the self-folding of a planar template. To control the folding of templates with competing folded structures, we propose t…
▽ More
Inverse design aims at the development of elementary building blocks that organize spontaneously into target shapes. In self-assembly, the blocks diffuse to their target position. Alternatively, recent experiments point to a more robust process in which the shape is formed from the self-folding of a planar template. To control the folding of templates with competing folded structures, we propose the inclusion of bond specificity. We consider a template that can fold into an octahedron or a boat shell and find the minimal design capable of targeting either shell or switching between the two through an external stimulus, adding a new dimension to the design of shape-changing materials.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Generalized Information Criteria for Structured Sparse Models
Authors:
Eduardo F. Mendes,
Gabriel J. P. Pinto
Abstract:
Regularized m-estimators are widely used due to their ability of recovering a low-dimensional model in high-dimensional scenarios. Some recent efforts on this subject focused on creating a unified framework for establishing oracle bounds, and deriving conditions for support recovery. Under this same framework, we propose a new Generalized Information Criteria (GIC) that takes into consideration th…
▽ More
Regularized m-estimators are widely used due to their ability of recovering a low-dimensional model in high-dimensional scenarios. Some recent efforts on this subject focused on creating a unified framework for establishing oracle bounds, and deriving conditions for support recovery. Under this same framework, we propose a new Generalized Information Criteria (GIC) that takes into consideration the sparsity pattern one wishes to recover. We obtain non-asymptotic model selection bounds and sufficient conditions for model selection consistency of the GIC. Furthermore, we show that the GIC can also be used for selecting the regularization parameter within a regularized $m$-estimation framework, which allows practical use of the GIC for model selection in high-dimensional scenarios. We provide examples of group LASSO in the context of generalized linear regression and low rank matrix regression.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
On the finitary content of Dykstra's cyclic projections algorithm
Authors:
Pedro Pinto
Abstract:
We study the asymptotic behaviour of the well-known Dykstra's algorithm through the lens of proof-theoretical techniques. We provide an elementary proof for the convergence of Dykstra's algorithm in which the standard argument is stripped to its central features and where the original compactness principles are circumvented, additionally providing highly uniform primitive recursive rates of metast…
▽ More
We study the asymptotic behaviour of the well-known Dykstra's algorithm through the lens of proof-theoretical techniques. We provide an elementary proof for the convergence of Dykstra's algorithm in which the standard argument is stripped to its central features and where the original compactness principles are circumvented, additionally providing highly uniform primitive recursive rates of metastability in a full general setting. Moreover, under an additional assumption, we are even able to obtain effective general rates of convergence. We argue that such additional condition is actually necessary for the existence of general uniform rates of convergence.
△ Less
Submitted 16 June, 2023;
originally announced June 2023.
-
Design strategies for the self-assembly of polyhedral shells
Authors:
Diogo E. P. Pinto,
Petr Sulc,
Francesco Sciortino,
John Russo
Abstract:
The control over the self-assembly of complex structures is a long-standing challenge of material science, especially at the colloidal scale, as the desired assembly pathway is often kinetically derailed by the formation of amorphous aggregates. Here we investigate in detail the problem of the self-assembly of the three Archimedean shells with five contact points per vertex, i.e. the icosahedron,…
▽ More
The control over the self-assembly of complex structures is a long-standing challenge of material science, especially at the colloidal scale, as the desired assembly pathway is often kinetically derailed by the formation of amorphous aggregates. Here we investigate in detail the problem of the self-assembly of the three Archimedean shells with five contact points per vertex, i.e. the icosahedron, the snub cube, and the snub dodecahedron. We use patchy particles with five interaction sites (or patches) as model for the building blocks, and recast the assembly problem as a Boolean satisfiability problem (SAT) for the patch-patch interactions. This allows us to find effective designs for all targets, and to selectively suppress unwanted structures. By tuning the geometrical arrangement and the specific interactions of the patches, we demonstrate that lowering the symmetry of the building blocks reduces the number of competing structures, which in turn can considerably increase the yield of the target structure. These results cement SAT-assembly as an invaluable tool to solve inverse design problems.
△ Less
Submitted 13 April, 2023;
originally announced April 2023.
-
Two-step nucleation in a binary mixture of Patchy Particles
Authors:
Camilla Beneduce,
Diogo E. P. Pinto,
Petr Sulc,
Francesco Sciortino,
John Russo
Abstract:
Nucleation in systems with a metastable liquid-gas critical point is the prototypical example of a two-step nucleation process, in which the appearance of the critical nucleus is preceded by the formation of a liquid-like density fluctuation. So far, the majority of studies on colloidal and protein crystallization have focused on one-component systems, and we are lacking a clear description of two…
▽ More
Nucleation in systems with a metastable liquid-gas critical point is the prototypical example of a two-step nucleation process, in which the appearance of the critical nucleus is preceded by the formation of a liquid-like density fluctuation. So far, the majority of studies on colloidal and protein crystallization have focused on one-component systems, and we are lacking a clear description of two-step nucleation processes in multicomponent systems, where critical fluctuations involve coupled density and concentrations inhomogeneities. Here, we examine the nucleation process of a binary mixture of patchy particles designed to nucleate into a diamond lattice. By combining Gibbs-ensemble simulations and direct nucleation simulations over a wide range of thermodynamic conditions, we are able to pin down the role of the liquid-gas metastable phase diagram on the nucleation process. In particular, we show that the strongest enhancement of crystallization occurs at an azeotropic point with the same stoichiometric composition of the crystal.
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
GPT-4 Technical Report
Authors:
OpenAI,
Josh Achiam,
Steven Adler,
Sandhini Agarwal,
Lama Ahmad,
Ilge Akkaya,
Florencia Leoni Aleman,
Diogo Almeida,
Janko Altenschmidt,
Sam Altman,
Shyamal Anadkat,
Red Avila,
Igor Babuschkin,
Suchir Balaji,
Valerie Balcom,
Paul Baltescu,
Haiming Bao,
Mohammad Bavarian,
Jeff Belgum,
Irwan Bello,
Jake Berdine,
Gabriel Bernadett-Shapiro,
Christopher Berner,
Lenny Bogdonoff,
Oleg Boiko
, et al. (256 additional authors not shown)
Abstract:
We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based mo…
▽ More
We report the development of GPT-4, a large-scale, multimodal model which can accept image and text inputs and produce text outputs. While less capable than humans in many real-world scenarios, GPT-4 exhibits human-level performance on various professional and academic benchmarks, including passing a simulated bar exam with a score around the top 10% of test takers. GPT-4 is a Transformer-based model pre-trained to predict the next token in a document. The post-training alignment process results in improved performance on measures of factuality and adherence to desired behavior. A core component of this project was develo** infrastructure and optimization methods that behave predictably across a wide range of scales. This allowed us to accurately predict some aspects of GPT-4's performance based on models trained with no more than 1/1,000th the compute of GPT-4.
△ Less
Submitted 4 March, 2024; v1 submitted 15 March, 2023;
originally announced March 2023.
-
On computational properties of Cauchy problems generated by accretive operators
Authors:
Pedro Pinto,
Nicholas Pischke
Abstract:
In this paper, we provide quantitative versions of results on the asymptotic behavior of nonlinear semigroups generated by an accretive operator due to O. Nevanlinna and S. Reich as well as H.-K. Xu. These results themselves rely on a particular assumption on the underlying operator introduced by A. Pazy under the name of `convergence condition'. Based on logical techniques from `proof mining', a…
▽ More
In this paper, we provide quantitative versions of results on the asymptotic behavior of nonlinear semigroups generated by an accretive operator due to O. Nevanlinna and S. Reich as well as H.-K. Xu. These results themselves rely on a particular assumption on the underlying operator introduced by A. Pazy under the name of `convergence condition'. Based on logical techniques from `proof mining', a subdiscipline of mathematical logic, we derive various notions of a `convergence condition with modulus' which provide quantitative information on this condition in different ways. These techniques then also facilitate the extraction of quantitative information on the convergence results of Nevanlinna and Reich as well as Xu, in particular also in the form of rates of convergence which depend on these moduli for the convergence condition.
△ Less
Submitted 11 October, 2023; v1 submitted 17 January, 2023;
originally announced January 2023.
-
Worldwide AI Ethics: a review of 200 guidelines and recommendations for AI governance
Authors:
Nicholas Kluge Corrêa,
Camila Galvão,
James William Santos,
Carolina Del Pino,
Edson Pontes Pinto,
Camila Barbosa,
Diogo Massmann,
Rodrigo Mambrini,
Luiza Galvão,
Edmund Terem,
Nythamar de Oliveira
Abstract:
The utilization of artificial intelligence (AI) applications has experienced tremendous growth in recent years, bringing forth numerous benefits and conveniences. However, this expansion has also provoked ethical concerns, such as privacy breaches, algorithmic discrimination, security and reliability issues, transparency, and other unintended consequences. To determine whether a global consensus e…
▽ More
The utilization of artificial intelligence (AI) applications has experienced tremendous growth in recent years, bringing forth numerous benefits and conveniences. However, this expansion has also provoked ethical concerns, such as privacy breaches, algorithmic discrimination, security and reliability issues, transparency, and other unintended consequences. To determine whether a global consensus exists regarding the ethical principles that should govern AI applications and to contribute to the formation of future regulations, this paper conducts a meta-analysis of 200 governance policies and ethical guidelines for AI usage published by public bodies, academic institutions, private companies, and civil society organizations worldwide. We identified at least 17 resonating principles prevalent in the policies and guidelines of our dataset, released as an open-source database and tool. We present the limitations of performing a global scale analysis study paired with a critical analysis of our findings, presenting areas of consensus that should be incorporated into future regulatory efforts. All components tied to this work can be found in https://nkluge-correa.github.io/worldwide_AI-ethics/
△ Less
Submitted 19 February, 2024; v1 submitted 23 June, 2022;
originally announced June 2022.
-
Unifying interval maps and branching systems with applications to relative graph C*-algebras
Authors:
Carlos Correia Ramos,
Daniel Gonçalves,
Nuno Martins,
Paulo R. Pinto
Abstract:
We describe Markov interval maps via branching systems and develop the theory of relative branching systems, characterizing when the associated representations of relative graph C*-algebras are faithful. When the Markov interval maps $f$ have escape sets, we use our results to characterize injectivity of the associated relative graph algebra representations, improving on previous work by the first…
▽ More
We describe Markov interval maps via branching systems and develop the theory of relative branching systems, characterizing when the associated representations of relative graph C*-algebras are faithful. When the Markov interval maps $f$ have escape sets, we use our results to characterize injectivity of the associated relative graph algebra representations, improving on previous work by the first, third, and fourth authors.
△ Less
Submitted 17 June, 2022;
originally announced June 2022.
-
Rates of asymptotic regularity for the alternating Halpern-Mann iteration
Authors:
Laurentiu Leustean,
Pedro Pinto
Abstract:
In this paper we extend to $UCW$-hyperbolic spaces the quantitative asymptotic regularity results for the alternating Halpern-Mann iteration obtained by Dinis and the second author for CAT(0) spaces. These results are new even for uniformly convex normed spaces. Furthermore, for a particular choice of the parameter sequences, we compute linear rates of asymptotic regularity in $W$-hyperbolic space…
▽ More
In this paper we extend to $UCW$-hyperbolic spaces the quantitative asymptotic regularity results for the alternating Halpern-Mann iteration obtained by Dinis and the second author for CAT(0) spaces. These results are new even for uniformly convex normed spaces. Furthermore, for a particular choice of the parameter sequences, we compute linear rates of asymptotic regularity in $W$-hyperbolic spaces and quadratic rates of $T$- and $U$-asymptotic regularity in CAT(0) spaces.
△ Less
Submitted 27 February, 2023; v1 submitted 5 June, 2022;
originally announced June 2022.
-
OCR Synthetic Benchmark Dataset for Indic Languages
Authors:
Naresh Saini,
Promodh Pinto,
Aravinth Bheemaraj,
Deepak Kumar,
Dhiraj Daga,
Saurabh Yadav,
Srihari Nagaraj
Abstract:
We present the largest publicly available synthetic OCR benchmark dataset for Indic languages. The collection contains a total of 90k images and their ground truth for 23 Indic languages. OCR model validation in Indic languages require a good amount of diverse data to be processed in order to create a robust and reliable model. Generating such a huge amount of data would be difficult otherwise but…
▽ More
We present the largest publicly available synthetic OCR benchmark dataset for Indic languages. The collection contains a total of 90k images and their ground truth for 23 Indic languages. OCR model validation in Indic languages require a good amount of diverse data to be processed in order to create a robust and reliable model. Generating such a huge amount of data would be difficult otherwise but with synthetic data, it becomes far easier. It can be of great importance to fields like Computer Vision or Image Processing where once an initial synthetic data is developed, model creation becomes easier. Generating synthetic data comes with the flexibility to adjust its nature and environment as and when required in order to improve the performance of the model. Accuracy for labeled real-time data is sometimes quite expensive while accuracy for synthetic data can be easily achieved with a good score.
△ Less
Submitted 5 May, 2022;
originally announced May 2022.
-
Weighted Connected Matchings
Authors:
Guilherme C. M. Gomes,
Bruno P. Masquio,
Paulo E. D. Pinto,
Vinicius F. dos Santos,
Jayme L. Szwarcfiter
Abstract:
A matching $M$ is a $\mathscr{P}$-matching if the subgraph induced by the endpoints of the edges of $M$ satisfies property $\mathscr{P}$. As examples, for appropriate choices of $\mathscr{P}$, the problems Induced Matching, Uniquely Restricted Matching, Connected Matching and Disconnected Matching arise. For many of these problems, finding a maximum $\mathscr{P}$-matching is a knowingly NP-Hard pr…
▽ More
A matching $M$ is a $\mathscr{P}$-matching if the subgraph induced by the endpoints of the edges of $M$ satisfies property $\mathscr{P}$. As examples, for appropriate choices of $\mathscr{P}$, the problems Induced Matching, Uniquely Restricted Matching, Connected Matching and Disconnected Matching arise. For many of these problems, finding a maximum $\mathscr{P}$-matching is a knowingly NP-Hard problem, with few exceptions, such as connected matchings, which has the same time complexity as usual Maximum Matching problem. The weighted variant of Maximum Matching has been studied for decades, with many applications, including the well-known Assignment problem. Motivated by this fact, in addition to some recent researches in weighted versions of acyclic and induced matchings, we study the Maximum Weight Connected Matching. In this problem, we want to find a matching $M$ such that the endpoint vertices of its edges induce a connected subgraph and the sum of the edge weights of $M$ is maximum. Unlike the unweighted Connected Matching problem, which is in P for general graphs, we show that Maximum Weight Connected Matching is NP-Hard even for bounded diameter bipartite graphs, starlike graphs, planar bipartite, and bounded degree planar graphs, while solvable in linear time for trees and subcubic graphs. When we restrict edge weights to be non negative only, we show that the problem turns to be polynomially solvable for chordal graphs, while it remains NP-Hard for most of the cases when weights can be negative. Our final contributions are on parameterized complexity. On the positive side, we present a single exponential time algorithm when parameterized by treewidth. In terms of kernelization, we show that, even when restricted to binary weights, Weighted Connected Matching does not admit a polynomial kernel when parameterized by vertex cover under standard complexity-theoretical hypotheses.
△ Less
Submitted 9 February, 2022;
originally announced February 2022.
-
Strong convergence for the alternating Halpern-Mann iteration in CAT(0) spaces
Authors:
Bruno Dinis,
Pedro Pinto
Abstract:
In this paper we consider, in the general context of CAT(0) spaces, an iterative schema which alternates between Halpern and Krasnoselskii-Mann style iterations. We prove, under suitable conditions, the strong convergence of this algorithm, benefiting from ideas from the proof mining program. We give quantitative information in the form of effective rates of asymptotic regularity and of metastabil…
▽ More
In this paper we consider, in the general context of CAT(0) spaces, an iterative schema which alternates between Halpern and Krasnoselskii-Mann style iterations. We prove, under suitable conditions, the strong convergence of this algorithm, benefiting from ideas from the proof mining program. We give quantitative information in the form of effective rates of asymptotic regularity and of metastability (in the sense of Tao). Motivated by these results we are also able to obtain strongly convergent versions of the forward-backward and the Douglas-Rachford algorithms. Our results generalize recent work by Boţ, Csetnek and Meier, and Cheval and Leuştean.
△ Less
Submitted 9 March, 2023; v1 submitted 29 December, 2021;
originally announced December 2021.
-
Disconnected Matchings
Authors:
Guilherme C. M. Gomes,
Bruno P. Masquio,
Paulo E. D. Pinto,
Vinicius F. dos Santos,
Jayme L. Szwarcfiter
Abstract:
In 2005, Goddard, Hedetniemi, Hedetniemi and Laskar [Generalized subgraph-restricted matchings in graphs, Discrete Mathematics, 293 (2005) 129 - 138] asked the computational complexity of determining the maximum cardinality of a matching whose vertex set induces a disconnected graph. In this paper we answer this question. In fact, we consider the generalized problem of finding $c$-disconnected mat…
▽ More
In 2005, Goddard, Hedetniemi, Hedetniemi and Laskar [Generalized subgraph-restricted matchings in graphs, Discrete Mathematics, 293 (2005) 129 - 138] asked the computational complexity of determining the maximum cardinality of a matching whose vertex set induces a disconnected graph. In this paper we answer this question. In fact, we consider the generalized problem of finding $c$-disconnected matchings; such matchings are ones whose vertex sets induce subgraphs with at least $c$ connected components. We show that, for every fixed $c \geq 2$, this problem is NP-complete even if we restrict the input to bounded diameter bipartite graphs, while can be solved in polynomial time if $c = 1$. For the case when $c$ is part of the input, we show that the problem is NP-complete for chordal graphs, while being solvable in polynomial time for interval graphs. Finally, we explore the parameterized complexity of the problem. We present an FPT algorithm under the treewidth parameterization, and an XP algorithm for graphs with a polynomial number of minimal separators when parameterized by $c$. We complement these results by showing that, unless NP $\subseteq$ coNP/poly, the related Induced Matching problem does not admit a polynomial kernel when parameterized by vertex cover and size of the matching nor when parameterized by vertex deletion distance to clique and size of the matching. As for Connected Matching, we show how to obtain a maximum connected matching in linear time given an arbitrary maximum matching in the input.
△ Less
Submitted 16 December, 2021;
originally announced December 2021.
-
Substrate disorder promotes cell motility in confluent tissues
Authors:
Diogo E. P. Pinto,
Margarida M. Telo da Gama,
Nuno A. M. Araujo
Abstract:
In vivo and in vitro cells rely on the support of an underlying biocompatible substrate, such as the extracellular matrix or a culture substrate, to spread and proliferate. The mechanical and chemical properties of such structures play a central role in the dynamical and statistical properties of the tissue. At the cell scale, these substrates are highly disordered. Here, we investigate how spatia…
▽ More
In vivo and in vitro cells rely on the support of an underlying biocompatible substrate, such as the extracellular matrix or a culture substrate, to spread and proliferate. The mechanical and chemical properties of such structures play a central role in the dynamical and statistical properties of the tissue. At the cell scale, these substrates are highly disordered. Here, we investigate how spatial heterogeneities of the cell-substrate interaction influence the motility of the cells in a model confluent tissue. We use the Self-Propelled Voronoi model and describe the disorder as a spatially dependent preferred geometry of the individual cells. We found that when the characteristic length scale of the preferred geometry is smaller than the cell size, the tissue is less rigid than its homogeneous counterpart, with a consequent increase in cell motility. This result is in sharp contrast to what has been reported for tissues with heterogeneity in the mechanical properties of the individual cells, where the disorder favors rigidity. Using the fraction of rigid cells, we observe a collapse of the motility data for different model parameters and provide evidence that the rigidity transition in the model tissue is accompanied by the emergence of a spanning cluster of rigid cells.
△ Less
Submitted 2 November, 2021;
originally announced November 2021.
-
The $\texttt{Abacus}$ Cosmological $N$-body Code
Authors:
Lehman H. Garrison,
Daniel J. Eisenstein,
Douglas Ferrer,
Nina A. Maksimova,
Philip A. Pinto
Abstract:
We present $\texttt{Abacus}$, a fast and accurate cosmological $N$-body code based on a new method for calculating the gravitational potential from a static multipole mesh. The method analytically separates the near- and far-field forces, reducing the former to direct $1/r^2$ summation and the latter to a discrete convolution over multipoles. The method achieves 70 million particle updates per sec…
▽ More
We present $\texttt{Abacus}$, a fast and accurate cosmological $N$-body code based on a new method for calculating the gravitational potential from a static multipole mesh. The method analytically separates the near- and far-field forces, reducing the former to direct $1/r^2$ summation and the latter to a discrete convolution over multipoles. The method achieves 70 million particle updates per second per node of the Summit supercomputer, while maintaining a median fractional force error of $10^{-5}$. We express the simulation time step as an event-driven "pipeline", incorporating asynchronous events such as completion of co-processor work, Input/Output, and network communication. $\texttt{Abacus}$ has been used to produce the largest suite of $N$-body simulations to date, the $\texttt{AbacusSummit}$ suite of 60 trillion particles (Maksimova et al., 2021), incorporating on-the-fly halo finding. $\texttt{Abacus}$ enables the production of mock catalogs of the volume and resolution required by the coming generation of cosmological surveys.
△ Less
Submitted 21 October, 2021;
originally announced October 2021.
-
Hierarchical structure of the energy landscape in the Voronoi model of dense tissue
Authors:
D. E. P. Pinto,
D. M. Sussman,
M. M. Telo da Gama,
N. A. M. Araujo
Abstract:
The Voronoi model is a popular tool for studying confluent living tissues. It exhibits an anomalous glassy behavior even at very low temperatures or weak active self-propulsion, and at zero temperature the model exhibits a disordered solid structure with no evidence of a rigidity transition. Here we investigate the properties of the energy landscape in this limit. We find two disordered solid phas…
▽ More
The Voronoi model is a popular tool for studying confluent living tissues. It exhibits an anomalous glassy behavior even at very low temperatures or weak active self-propulsion, and at zero temperature the model exhibits a disordered solid structure with no evidence of a rigidity transition. Here we investigate the properties of the energy landscape in this limit. We find two disordered solid phases that have similar structural features but that differ in the ultrametricity of their energy landscapes; the crossover between these two states shares phenomenological properties with a Gardner transition. We further highlight how the metric used to calculate distances between configurations influences the ability to detect hierarchical arrangements of basins in the energy landscape.
△ Less
Submitted 29 September, 2021;
originally announced September 2021.
-
Evaluating Large Language Models Trained on Code
Authors:
Mark Chen,
Jerry Tworek,
Heewoo Jun,
Qiming Yuan,
Henrique Ponde de Oliveira Pinto,
Jared Kaplan,
Harri Edwards,
Yuri Burda,
Nicholas Joseph,
Greg Brockman,
Alex Ray,
Raul Puri,
Gretchen Krueger,
Michael Petrov,
Heidy Khlaaf,
Girish Sastry,
Pamela Mishkin,
Brooke Chan,
Scott Gray,
Nick Ryder,
Mikhail Pavlov,
Alethea Power,
Lukasz Kaiser,
Mohammad Bavarian,
Clemens Winter
, et al. (33 additional authors not shown)
Abstract:
We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we release to measure functional correctness for synthesizing programs from docstrings, our model solves 28.8% of the problems, while GPT-3 solves 0% and GPT-J sol…
▽ More
We introduce Codex, a GPT language model fine-tuned on publicly available code from GitHub, and study its Python code-writing capabilities. A distinct production version of Codex powers GitHub Copilot. On HumanEval, a new evaluation set we release to measure functional correctness for synthesizing programs from docstrings, our model solves 28.8% of the problems, while GPT-3 solves 0% and GPT-J solves 11.4%. Furthermore, we find that repeated sampling from the model is a surprisingly effective strategy for producing working solutions to difficult prompts. Using this method, we solve 70.2% of our problems with 100 samples per problem. Careful investigation of our model reveals its limitations, including difficulty with docstrings describing long chains of operations and with binding operations to variables. Finally, we discuss the potential broader impacts of deploying powerful code generation technologies, covering safety, security, and economics.
△ Less
Submitted 14 July, 2021; v1 submitted 7 July, 2021;
originally announced July 2021.
-
Quantitative translations for viscosity approximation methods in hyperbolic spaces
Authors:
Ulrich Kohlenbach,
Pedro Pinto
Abstract:
In the setting of hyperbolic spaces, we show that the convergence of Browder-type sequences and Halpern iterations respectively entail the convergence of their viscosity version with a Rakotch map. We also show that the convergence of a hybrid viscosity version of the Krasnoselskii-Mann iteration follows from the convergence of the Browder type sequence. Our results follow from proof-theoretic tec…
▽ More
In the setting of hyperbolic spaces, we show that the convergence of Browder-type sequences and Halpern iterations respectively entail the convergence of their viscosity version with a Rakotch map. We also show that the convergence of a hybrid viscosity version of the Krasnoselskii-Mann iteration follows from the convergence of the Browder type sequence. Our results follow from proof-theoretic techniques (proof mining). From an analysis of theorems due to T. Suzuki, we extract a transformation of rates for the original Browder type and Halpern iterations into rates for the corresponding viscosity versions. We show that these transformations can be applied to earlier quantitative studies of these iterations. From an analysis of a theorem due to H.-K. Xu, N. Altwaijry and S. Chebbi, we obtain similar results. Finally, in uniformly convex Banach spaces we study a strong notion of accretive operator due to Brezis and Sibony and extract an uniform modulus of uniqueness for the property of being a zero point. In this context, we show that it is possible to obtain Cauchy rates for the Browder type and the Halpern iterations (and hence also for their viscosity versions).
△ Less
Submitted 7 February, 2021;
originally announced February 2021.
-
Effective metastability for a method of alternating resolvents
Authors:
Bruno Dinis,
Pedro Pinto
Abstract:
A generalized method of alternating resolvents was introduced by Boikanyo and Moro{\c s}anu as a way to approximate common zeros of two maximal monotone operators. In this paper we analyse the strong convergence of this algorithm under two different sets of conditions. As a consequence we obtain effective rates of metastability (in the sense of Terence Tao) and quasi-rates of asymptotic regularity…
▽ More
A generalized method of alternating resolvents was introduced by Boikanyo and Moro{\c s}anu as a way to approximate common zeros of two maximal monotone operators. In this paper we analyse the strong convergence of this algorithm under two different sets of conditions. As a consequence we obtain effective rates of metastability (in the sense of Terence Tao) and quasi-rates of asymptotic regularity. Furthermore, we bypass the need for sequential weak compactness in the original proofs. Our quantitative results are obtained using proof-theoretical techniques in the context of the proof mining program.
△ Less
Submitted 29 January, 2021;
originally announced January 2021.
-
Asymmetric self-play for automatic goal discovery in robotic manipulation
Authors:
OpenAI OpenAI,
Matthias Plappert,
Raul Sampedro,
Tao Xu,
Ilge Akkaya,
Vineet Kosaraju,
Peter Welinder,
Ruben D'Sa,
Arthur Petron,
Henrique P. d. O. Pinto,
Alex Paino,
Hyeonwoo Noh,
Lilian Weng,
Qiming Yuan,
Casey Chu,
Wojciech Zaremba
Abstract:
We train a single, goal-conditioned policy that can solve many robotic manipulation tasks, including tasks with previously unseen goals and objects. We rely on asymmetric self-play for goal discovery, where two agents, Alice and Bob, play a game. Alice is asked to propose challenging goals and Bob aims to solve them. We show that this method can discover highly diverse and complex goals without an…
▽ More
We train a single, goal-conditioned policy that can solve many robotic manipulation tasks, including tasks with previously unseen goals and objects. We rely on asymmetric self-play for goal discovery, where two agents, Alice and Bob, play a game. Alice is asked to propose challenging goals and Bob aims to solve them. We show that this method can discover highly diverse and complex goals without any human priors. Bob can be trained with only sparse rewards, because the interaction between Alice and Bob results in a natural curriculum and Bob can learn from Alice's trajectory when relabeled as a goal-conditioned demonstration. Finally, our method scales, resulting in a single policy that can generalize to many unseen tasks such as setting a table, stacking blocks, and solving simple puzzles. Videos of a learned policy is available at https://robotics-self-play.github.io.
△ Less
Submitted 13 January, 2021;
originally announced January 2021.
-
Representations of Higman-Thompson groups from Cuntz algebras
Authors:
Francisco Araújo,
Paulo R. Pinto
Abstract:
Every representation of the Cuntz algebra $\mathcal{O}_n$ leads to a unitary representation of the Higman-Thompson group $V_n$. We consider the family $\{π_x\}_{x\in [0,1[}$ of permutative representations of $\mathcal{O}_n$ that arise from the interval map $f(x)=nx$ (mod 1) acting on the Hilbert space that underlies each orbit, and then study the unitary equivalence and the irreducibility of the c…
▽ More
Every representation of the Cuntz algebra $\mathcal{O}_n$ leads to a unitary representation of the Higman-Thompson group $V_n$. We consider the family $\{π_x\}_{x\in [0,1[}$ of permutative representations of $\mathcal{O}_n$ that arise from the interval map $f(x)=nx$ (mod 1) acting on the Hilbert space that underlies each orbit, and then study the unitary equivalence and the irreducibility of the corresponding family $\{ρ_x\}_{x\in [0,1[}$ of representations of Higman-Thompson group $V_n$, showing that that these representations are indeed irreducible and moreover $ρ_x$ and $ρ_y$ are equivalent if and only if the orbits of $x$ and $y$ coincide.
△ Less
Submitted 23 December, 2021; v1 submitted 27 November, 2020;
originally announced November 2020.
-
On the convergence of algorithms with Tikhonov regularization terms
Authors:
Bruno Dinis,
Pedro Pinto
Abstract:
We consider the strongly convergent modified versions of the Krasnosel'skiĭ-Mann, the forward-backward and the Douglas-Rachford algorithms with Tikhonov regularization terms, introduced by Radu Boţ, Ernö Csetnek and Dennis Meier. We obtain quantitative information for these modified iterations, namely rates of asymptotic regularity and metastability. Furthermore, our arguments avoid the use of seq…
▽ More
We consider the strongly convergent modified versions of the Krasnosel'skiĭ-Mann, the forward-backward and the Douglas-Rachford algorithms with Tikhonov regularization terms, introduced by Radu Boţ, Ernö Csetnek and Dennis Meier. We obtain quantitative information for these modified iterations, namely rates of asymptotic regularity and metastability. Furthermore, our arguments avoid the use of sequential weak compactness and use only a weak form of the projection argument.
△ Less
Submitted 14 May, 2020;
originally announced May 2020.
-
The Distributional Stress-Energy Quadrupole
Authors:
Jonathan Gratus,
Paolo Pinto,
Spyridon Talaganis
Abstract:
We investigate stress-energy tensors constructed from the delta function on a worldline. We concentrate on quadrupoles as they make an excellent model for the dominant source of gravitational waves and have significant novel features. Unlike the dipole, we show that the quadrupole has 20 free components which are not determined by the properties of the stress-energy tensor. These need to be derive…
▽ More
We investigate stress-energy tensors constructed from the delta function on a worldline. We concentrate on quadrupoles as they make an excellent model for the dominant source of gravitational waves and have significant novel features. Unlike the dipole, we show that the quadrupole has 20 free components which are not determined by the properties of the stress-energy tensor. These need to be derived from an underlying model and we give an example motivated from a divergent-free dust. We show that the components corresponding to the partial derivatives representation of the quadrupole, have a gauge like freedom. We give the change of coordinate formula which involves second derivatives and two integrals. We also show how to define the quadrupole without reference to a coordinate systems or a metric. For the representation using covariant derivatives, we show how to split a quadrupole into a pure monopole, pure dipole and pure quadrupole in a coordinate free way.
△ Less
Submitted 25 November, 2020; v1 submitted 6 May, 2020;
originally announced May 2020.
-
The cell adaptation time sets a minimum length scale for patterned substrates
Authors:
Diogo E. P. Pinto,
Gonca Erdemci-Tandogan,
M. Lisa Manning,
Nuno A. M. Araujo
Abstract:
The structure and dynamics of tissue cultures depend strongly on the physical and chemical properties of the underlying substrate. Inspired by previous advances in the context of inorganic materials, the use of patterned culture surfaces has been proposed as an effective way to induce space-dependent properties in cell tissues. However, cells move and diffuse and the transduction of external stimu…
▽ More
The structure and dynamics of tissue cultures depend strongly on the physical and chemical properties of the underlying substrate. Inspired by previous advances in the context of inorganic materials, the use of patterned culture surfaces has been proposed as an effective way to induce space-dependent properties in cell tissues. However, cells move and diffuse and the transduction of external stimuli to biological signals is not instantaneous. Here, we show that the fidelity of patterns depends on the relation between the diffusion ($τ_D$) and adaptation ($τ$) times. Numerical results for the self-propelled Voronoi model reveal that the fidelity decreases with $τ/τ_D$, a result that is reproduced by a continuum reaction-diffusion model. We derive a minimum length scale for the patterns that depends on $τ/τ_D$ and can be much larger than the cell size.
△ Less
Submitted 1 May, 2020;
originally announced May 2020.
-
Quantitative results on a Halpern-type proximal point algorithm
Authors:
Laurentiu Leustean,
Pedro Pinto
Abstract:
We apply proof mining methods to analyse a result of Boikanyo and Moroşanu on the strong convergence of a Halpern-type proximal point algorithm. As a consequence, we obtain quantitative versions of this result, providing uniform effective rates of asymptotic regularity and metastability.
We apply proof mining methods to analyse a result of Boikanyo and Moroşanu on the strong convergence of a Halpern-type proximal point algorithm. As a consequence, we obtain quantitative versions of this result, providing uniform effective rates of asymptotic regularity and metastability.
△ Less
Submitted 26 February, 2021; v1 submitted 27 January, 2020;
originally announced January 2020.
-
A rate of metastability for the Halpern type Proximal Point Algorithm
Authors:
Pedro Pinto
Abstract:
Using proof-theoretical techniques, we analyze a proof by H.-K. Xu regarding a result of strong convergence for the Halpern type proximal point algorithm. We obtain a rate of metastability (in the sense of T. Tao) and also a rate of asymptotic regularity for the iteration. Furthermore, our final quantitative result bypasses the need of the sequential weak compactness argument present in the origin…
▽ More
Using proof-theoretical techniques, we analyze a proof by H.-K. Xu regarding a result of strong convergence for the Halpern type proximal point algorithm. We obtain a rate of metastability (in the sense of T. Tao) and also a rate of asymptotic regularity for the iteration. Furthermore, our final quantitative result bypasses the need of the sequential weak compactness argument present in the original proof. This elimination is reflected in the extraction of primitive recursive quantitative information. This work follows from recent results in Proof Mining regarding the removal of sequential weak compactness arguments.
△ Less
Submitted 28 December, 2019;
originally announced December 2019.
-
Quantitative results on the multi-parameters Proximal Point Algorithm
Authors:
Bruno Dinis,
Pedro Pinto
Abstract:
We give a quantitative analysis of a theorem due to Fenghui Wang and Huanhuan Cui concerning the convergence of a multi-parametric version of the proximal point algorithm. Wang and Cui's result ensures the convergence of the algorithm to a zero of the operator. Our quantitative analysis provides explicit bounds on the metastability (in the sense of Terence Tao) for the convergence and the asymptot…
▽ More
We give a quantitative analysis of a theorem due to Fenghui Wang and Huanhuan Cui concerning the convergence of a multi-parametric version of the proximal point algorithm. Wang and Cui's result ensures the convergence of the algorithm to a zero of the operator. Our quantitative analysis provides explicit bounds on the metastability (in the sense of Terence Tao) for the convergence and the asymptotic regularity of the iteration. Moreover, our analysis bypasses the need of sequential weak compactness and only requires a weak form of the metric projection argument.
△ Less
Submitted 19 December, 2019;
originally announced December 2019.
-
Dota 2 with Large Scale Deep Reinforcement Learning
Authors:
OpenAI,
:,
Christopher Berner,
Greg Brockman,
Brooke Chan,
Vicki Cheung,
Przemysław Dębiak,
Christy Dennison,
David Farhi,
Quirin Fischer,
Shariq Hashme,
Chris Hesse,
Rafal Józefowicz,
Scott Gray,
Catherine Olsson,
Jakub Pachocki,
Michael Petrov,
Henrique P. d. O. Pinto,
Jonathan Raiman,
Tim Salimans,
Jeremy Schlatter,
Jonas Schneider,
Szymon Sidor,
Ilya Sutskever,
Jie Tang
, et al. (2 additional authors not shown)
Abstract:
On April 13th, 2019, OpenAI Five became the first AI system to defeat the world champions at an esports game. The game of Dota 2 presents novel challenges for AI systems such as long time horizons, imperfect information, and complex, continuous state-action spaces, all challenges which will become increasingly central to more capable AI systems. OpenAI Five leveraged existing reinforcement learnin…
▽ More
On April 13th, 2019, OpenAI Five became the first AI system to defeat the world champions at an esports game. The game of Dota 2 presents novel challenges for AI systems such as long time horizons, imperfect information, and complex, continuous state-action spaces, all challenges which will become increasingly central to more capable AI systems. OpenAI Five leveraged existing reinforcement learning techniques, scaled to learn from batches of approximately 2 million frames every 2 seconds. We developed a distributed training system and tools for continual training which allowed us to train OpenAI Five for 10 months. By defeating the Dota 2 world champion (Team OG), OpenAI Five demonstrates that self-play reinforcement learning can achieve superhuman performance on a difficult task.
△ Less
Submitted 13 December, 2019;
originally announced December 2019.
-
Metastability of the proximal point algorithm with multi-parameters
Authors:
Bruno Dinis,
Pedro Pinto
Abstract:
In this article we use techniques of proof mining to analyse a result, due to Yonghong Yao and Muhammad Aslam Noor, concerning the strong convergence of a generalized proximal point algorithm which involves multiple parameters. Yao and Noor's result ensures the strong convergence of the algorithm to the nearest projection point onto the set of zeros of the operator. Our quantitative analysis, guid…
▽ More
In this article we use techniques of proof mining to analyse a result, due to Yonghong Yao and Muhammad Aslam Noor, concerning the strong convergence of a generalized proximal point algorithm which involves multiple parameters. Yao and Noor's result ensures the strong convergence of the algorithm to the nearest projection point onto the set of zeros of the operator. Our quantitative analysis, guided by Fernando Ferreira and Paulo Oliva's bounded functional interpretation, provides a primitive recursive bound on the metastability for the convergence of the algorithm, in the sense of Terence Tao. Furthermore, we obtain quantitative information on the asymptotic regularity of the iteration. The results of this paper are made possible by an arithmetization of the $\limsup$.
△ Less
Submitted 26 March, 2020; v1 submitted 21 June, 2019;
originally announced June 2019.
-
Cosmology with Stacked Cluster Weak Lensing and Cluster-Galaxy Cross-Correlations
Authors:
Andrés N. Salcedo,
Benjamin D. Wibking,
David H. Weinberg,
Hao-Yi Wu,
Douglas Ferrer,
Daniel Eisenstein,
Philip Pinto
Abstract:
Cluster weak lensing is a sensitive probe of cosmology, particularly the amplitude of matter clustering $σ_8$ and matter density parameter $Ω_m$. The main nuisance parameter in a cluster weak lensing cosmological analysis is the scatter between the true halo mass and the relevant cluster observable, denoted $σ_{\ln Mc}$. We show that combining the cluster weak lensing observable $ΔΣ$ with the proj…
▽ More
Cluster weak lensing is a sensitive probe of cosmology, particularly the amplitude of matter clustering $σ_8$ and matter density parameter $Ω_m$. The main nuisance parameter in a cluster weak lensing cosmological analysis is the scatter between the true halo mass and the relevant cluster observable, denoted $σ_{\ln Mc}$. We show that combining the cluster weak lensing observable $ΔΣ$ with the projected cluster-galaxy cross-correlation function $w_{p,cg}$ and galaxy auto-correlation function $w_{p,gg}$ can break the degeneracy between $σ_8$ and $σ_{\ln Mc}$ to achieve tight, percent-level constraints on $σ_8$. Using a grid of cosmological N-body simulations, we compute derivatives of $ΔΣ$, $w_{p,cg}$, and $w_{p,gg}$ with respect to $σ_8$, $Ω_m$, $σ_{\ln Mc}$ and halo occupation distribution (HOD) parameters describing the galaxy population. We also compute covariance matrices motivated by the properties of the Dark Energy Suvery (DES) cluster and weak lensing survey and the BOSS CMASS galaxy redshift survey. For our fiducial scenario combining $ΔΣ$, $w_{p,cg}$, and $w_{p,gg}$ measured over $0.3-30.0 \; h^{-1} \; \mathrm{Mpc}$, for clusters at $z=0.35-0.55$ above a mass threshold $M_c\approx 2\times 10^{14} \; h^{-1} \; \mathrm{M_{\odot}}$, we forecast a $1.4\%$ constraint on $σ_8$ while marginalizing over $σ_{\ln Mc}$ and all HOD parameters. Reducing the mass threshold to $1\times 10^{14} \; h^{-1} \; \mathrm{M_{\odot}}$ and adding a $z=0.15-0.35$ redshift bin sharpens this constraint to $0.8\%$. The small scale $(r_p < 3.0 \; h^{-1} \; \mathrm{Mpc})$ ``mass function'' and large scale $(r_p > 3.0 \; h^{-1} \; \mathrm{Mpc})$ ``halo-mass cross-correlation'' regimes of $ΔΣ$ have comparable constraining power, allowing internal consistency tests from such an analysis.
△ Less
Submitted 15 June, 2019;
originally announced June 2019.
-
An Efficient Monte Carlo-based Probabilistic Time-Dependent Routing Calculation Targeting a Server-Side Car Navigation System
Authors:
Emanuele Vitali,
Davide Gadioli,
Gianluca Palermo,
Martin Golasowski,
Joao Bispo,
Pedro Pinto,
Jan Martinovic,
Katerina Slaninova,
Joao M. P. Cardoso,
Cristina Silvano
Abstract:
Incorporating speed probability distribution to the computation of the route planning in car navigation systems guarantees more accurate and precise responses. In this paper, we propose a novel approach for dynamically selecting the number of samples used for the Monte Carlo simulation to solve the Probabilistic Time-Dependent Routing (PTDR) problem, thus improving the computation efficiency. The…
▽ More
Incorporating speed probability distribution to the computation of the route planning in car navigation systems guarantees more accurate and precise responses. In this paper, we propose a novel approach for dynamically selecting the number of samples used for the Monte Carlo simulation to solve the Probabilistic Time-Dependent Routing (PTDR) problem, thus improving the computation efficiency. The proposed method is used to determine in a proactive manner the number of simulations to be done to extract the travel-time estimation for each specific request while respecting an error threshold as output quality level. The methodology requires a reduced effort on the application development side. We adopted an aspect-oriented programming language (LARA) together with a flexible dynamic autotuning library (mARGOt) respectively to instrument the code and to take tuning decisions on the number of samples improving the execution efficiency. Experimental results demonstrate that the proposed adaptive approach saves a large fraction of simulations (between 36% and 81%) with respect to a static approach while considering different traffic situations, paths and error requirements. Given the negligible runtime overhead of the proposed approach, it results in an execution-time speedup between 1.5x and 5.1x. This speedup is reflected at infrastructure-level in terms of a reduction of around 36% of the computing resources needed to support the whole navigation pipeline.
△ Less
Submitted 18 January, 2019;
originally announced January 2019.
-
The ANTAREX Domain Specific Language for High Performance Computing
Authors:
Cristina Silvano,
Giovanni Agosta,
Andrea Bartolini,
Andrea R. Beccari,
Luca Benini,
Loïc Besnard,
João Bispo,
Radim Cmar,
João M. P. Cardoso,
Carlo Cavazzoni,
Daniele Cesarini,
Stefano Cherubin,
Federico Ficarelli,
Davide Gadioli,
Martin Golasowski,
Antonio Libri,
Jan Martinovič,
Gianluca Palermo,
Pedro Pinto,
Erven Rohou,
Kateřina Slaninová,
Emanuele Vitali
Abstract:
The ANTAREX project relies on a Domain Specific Language (DSL) based on Aspect Oriented Programming (AOP) concepts to allow applications to enforce extra functional properties such as energy-efficiency and performance and to optimize Quality of Service (QoS) in an adaptive way. The DSL approach allows the definition of energy-efficiency, performance, and adaptivity strategies as well as their enfo…
▽ More
The ANTAREX project relies on a Domain Specific Language (DSL) based on Aspect Oriented Programming (AOP) concepts to allow applications to enforce extra functional properties such as energy-efficiency and performance and to optimize Quality of Service (QoS) in an adaptive way. The DSL approach allows the definition of energy-efficiency, performance, and adaptivity strategies as well as their enforcement at runtime through application autotuning and resource and power management. In this paper, we present an overview of the key outcome of the project, the ANTAREX DSL, and some of its capabilities through a number of examples, including how the DSL is applied in the context of the project use cases.
△ Less
Submitted 18 January, 2019;
originally announced January 2019.
-
Parallel Clustering of Single Cell Transcriptomic Data with Split-Merge Sampling on Dirichlet Process Mixtures
Authors:
Tiehang Duan,
José P. Pinto,
Xiaohui Xie
Abstract:
Motivation: With the development of droplet based systems, massive single cell transcriptome data has become available, which enables analysis of cellular and molecular processes at single cell resolution and is instrumental to understanding many biological processes. While state-of-the-art clustering methods have been applied to the data, they face challenges in the following aspects: (1) the clu…
▽ More
Motivation: With the development of droplet based systems, massive single cell transcriptome data has become available, which enables analysis of cellular and molecular processes at single cell resolution and is instrumental to understanding many biological processes. While state-of-the-art clustering methods have been applied to the data, they face challenges in the following aspects: (1) the clustering quality still needs to be improved; (2) most models need prior knowledge on number of clusters, which is not always available; (3) there is a demand for faster computational speed. Results: We propose to tackle these challenges with Parallel Split Merge Sampling on Dirichlet Process Mixture Model (the Para-DPMM model). Unlike classic DPMM methods that perform sampling on each single data point, the split merge mechanism samples on the cluster level, which significantly improves convergence and optimality of the result. The model is highly parallelized and can utilize the computing power of high performance computing (HPC) clusters, enabling massive clustering on huge datasets. Experiment results show the model outperforms current widely used models in both clustering quality and computational speed. Availability: Source code is publicly available on https://github.com/tiehangd/Para_DPMM/tree/master/Para_DPMM_package
△ Less
Submitted 25 December, 2018;
originally announced December 2018.
-
Hubble drift in Palatini $f(\mathcal{R})$-theories
Authors:
L. Del Vecchio,
L. Fatibene,
S. Capozziello,
M. Ferraris,
P. Pinto,
S. Camera
Abstract:
In a Palatini $f(\mathcal{R})$-model, we define chonodynamical effects due to the choice of atomic clocks as standard reference clocks and we develop a formalism able to quantitatively separate them from the usual effective dark sources one has in extended theories. We apply the formalism to Hubble drift and briefly discuss the issue about the physical frame. In particular, we argue that there is…
▽ More
In a Palatini $f(\mathcal{R})$-model, we define chonodynamical effects due to the choice of atomic clocks as standard reference clocks and we develop a formalism able to quantitatively separate them from the usual effective dark sources one has in extended theories. We apply the formalism to Hubble drift and briefly discuss the issue about the physical frame. In particular, we argue that there is no physical frame in the sense one does different things in different frames and that, in a sense, is the physical characteristic of extended gravity. As an example, we discuss how Jordan frame may be well suited to discuss cosmology, though it fails within the solar system.
△ Less
Submitted 1 November, 2018; v1 submitted 25 October, 2018;
originally announced October 2018.
-
A High-Fidelity Realization of the Euclid Code Comparison $N$-body Simulation with Abacus
Authors:
Lehman H. Garrison,
Daniel J. Eisenstein,
Philip A. Pinto
Abstract:
We present a high-fidelity realization of the cosmological $N$-body simulation from the Schneider et al. (2016) code comparison project. The simulation was performed with our Abacus $N$-body code, which offers high force accuracy, high performance, and minimal particle integration errors. The simulation consists of $2048^3$ particles in a $500\ h^{-1}\mathrm{Mpc}$ box, for a particle mass of…
▽ More
We present a high-fidelity realization of the cosmological $N$-body simulation from the Schneider et al. (2016) code comparison project. The simulation was performed with our Abacus $N$-body code, which offers high force accuracy, high performance, and minimal particle integration errors. The simulation consists of $2048^3$ particles in a $500\ h^{-1}\mathrm{Mpc}$ box, for a particle mass of $1.2\times 10^9\ h^{-1}\mathrm{M}_\odot$ with $10\ h^{-1}\mathrm{kpc}$ spline softening. Abacus executed 1052 global time steps to $z=0$ in 107 hours on one dual-Xeon, dual-GPU node, for a mean rate of 23 million particles per second per step. We find Abacus is in good agreement with Ramses and Pkdgrav3 and less so with Gadget3. We validate our choice of time step by halving the step size and find sub-percent differences in the power spectrum and 2PCF at nearly all measured scales, with $<0.3\%$ errors at $k<10\ \mathrm{Mpc}^{-1}h$. On large scales, Abacus reproduces linear theory better than $0.01\%$. Simulation snapshots are available at http://nbody.rc.fas.harvard.edu/public/S2016 .
△ Less
Submitted 6 March, 2019; v1 submitted 5 October, 2018;
originally announced October 2018.
-
On the removal of weak compactness arguments in proof mining
Authors:
Fernando Ferreira,
Laurentiu Leustean,
Pedro Pinto
Abstract:
The main observation of this paper is that some sequential weak compactness arguments in Hilbert space theory can be replaced by Heine/Borel compactness arguments (for the strong topology). Even though the latter form of compactness fails in (infinite-dimensional) Hilbert spaces, it nevertheless trivializes under the so-called bounded functional interpretation. As a consequence, the proof mining p…
▽ More
The main observation of this paper is that some sequential weak compactness arguments in Hilbert space theory can be replaced by Heine/Borel compactness arguments (for the strong topology). Even though the latter form of compactness fails in (infinite-dimensional) Hilbert spaces, it nevertheless trivializes under the so-called bounded functional interpretation. As a consequence, the proof mining programme of extracting computational bounds from ordinary proofs of mathematics can be applied to {\em modified proofs} which use these false Heine/Borel compactness arguments. Additionally, the bounded functional interpretation provides good logical guidance in formulating quantitative versions of analytical statements. We illustrate these claims with three minings. The bounded functional interpretation is here used for the first time in proof mining.
△ Less
Submitted 25 July, 2019; v1 submitted 2 October, 2018;
originally announced October 2018.
-
On graph algebras from interval maps
Authors:
Carlos Correia Ramos,
Nuno Martins,
Paulo R. Pinto
Abstract:
We produce and study a family of representations of relative graph algebras on Hilbert spaces that arise from the orbits of points of one dimensional dynamical systems, where the underlying Markov interval maps $f$ have escape sets. We identify when such representations are faithful in terms of the transitions to the escape subintervals.
We produce and study a family of representations of relative graph algebras on Hilbert spaces that arise from the orbits of points of one dimensional dynamical systems, where the underlying Markov interval maps $f$ have escape sets. We identify when such representations are faithful in terms of the transitions to the escape subintervals.
△ Less
Submitted 19 July, 2018;
originally announced July 2018.
-
Extended Cosmology in Palatini f(R)-theories
Authors:
Paolo Pinto,
Leonardo Del Vecchio,
Lorenzo Fatibene,
Marco Ferraris
Abstract:
We consider the cosmological models based on Palatini f(R)-theory for the function f(R)=aR-2bR^2-3c/R, which, when only dust visible matter is considered, is called dune cosmology in view of the shape of the function f(R(a)) (being a the scale factor). We discuss about the meaning of solving the model, and interpret it according to Ehlers-Pirani-Schild framework as defining a Weyl geometry on spac…
▽ More
We consider the cosmological models based on Palatini f(R)-theory for the function f(R)=aR-2bR^2-3c/R, which, when only dust visible matter is considered, is called dune cosmology in view of the shape of the function f(R(a)) (being a the scale factor). We discuss about the meaning of solving the model, and interpret it according to Ehlers-Pirani-Schild framework as defining a Weyl geometry on spacetime. Accordingly, we extend the definitions of luminosity distance, proper distance, and redshift to Weyl geometries and fit the values of parameters to SNIa data. Since the theoretical prediction is model-dependent, we argue that the it is affected by an extra choice, namely a model for atomic clocks, which, in principle, produces observable effects. To the best of our knowledge, these effects have not being considered in the literature before.
△ Less
Submitted 26 February, 2020; v1 submitted 1 July, 2018;
originally announced July 2018.
-
Learning Deep Similarity Metric for 3D MR-TRUS Registration
Authors:
Grant Haskins,
Jochen Kruecker,
Uwe Kruger,
Sheng Xu,
Peter A. Pinto,
Brad J. Wood,
**kun Yan
Abstract:
Purpose: The fusion of transrectal ultrasound (TRUS) and magnetic resonance (MR) images for guiding targeted prostate biopsy has significantly improved the biopsy yield of aggressive cancers. A key component of MR-TRUS fusion is image registration. However, it is very challenging to obtain a robust automatic MR-TRUS registration due to the large appearance difference between the two imaging modali…
▽ More
Purpose: The fusion of transrectal ultrasound (TRUS) and magnetic resonance (MR) images for guiding targeted prostate biopsy has significantly improved the biopsy yield of aggressive cancers. A key component of MR-TRUS fusion is image registration. However, it is very challenging to obtain a robust automatic MR-TRUS registration due to the large appearance difference between the two imaging modalities. The work presented in this paper aims to tackle this problem by addressing two challenges: (i) the definition of a suitable similarity metric and (ii) the determination of a suitable optimization strategy.
Methods: This work proposes the use of a deep convolutional neural network to learn a similarity metric for MR-TRUS registration. We also use a composite optimization strategy that explores the solution space in order to search for a suitable initialization for the second-order optimization of the learned metric. Further, a multi-pass approach is used in order to smooth the metric for optimization.
Results: The learned similarity metric outperforms the classical mutual information and also the state-of-the-art MIND feature based methods. The results indicate that the overall registration framework has a large capture range. The proposed deep similarity metric based approach obtained a mean TRE of 3.86mm (with an initial TRE of 16mm) for this challenging problem.
Conclusion: A similarity metric that is learned using a deep neural network can be used to assess the quality of any given image registration and can be used in conjunction with the aforementioned optimization framework to perform automatic registration that is robust to poor initialization.
△ Less
Submitted 15 October, 2018; v1 submitted 12 June, 2018;
originally announced June 2018.
-
Discrete Relativistic Positioning Systems
Authors:
Sante Carloni,
Lorenzo Fatibene,
Marco Ferraris,
Raymond G. McLenaghan,
Paolo Pinto
Abstract:
We discuss the design for a discrete, immediate, simple relativistic positioning system (rPS) which is potentially able of self-positioning (up to isometries) and operating without calibration or ground control assistance. The design is discussed in dimension two on spacetime (i.e. one spatial dimension plus one time dimension), in Minkowski and Schwarzschild solutions, as well as in dimension thr…
▽ More
We discuss the design for a discrete, immediate, simple relativistic positioning system (rPS) which is potentially able of self-positioning (up to isometries) and operating without calibration or ground control assistance. The design is discussed in dimension two on spacetime (i.e. one spatial dimension plus one time dimension), in Minkowski and Schwarzschild solutions, as well as in dimension three (i.e. two spatial dimensions plus one time dimension) in Minkowski. The system works without calibration, clock synchronizations, or a priori knowledge about the motion of clocks, it is able to self-diagnose hypotheses break down (for example, if one clock temporarily becomes not-freely falling, or the gravitational field changes) and it is automatically back and operational when the assumed conditions are restored. In the Schwarzschild case, we show that the system can also best fit the gravitational mass of the source of the gravitational field and stress that no weak field assumptions are made anywhere. In particular, the rPS we propose can work in a region close to the horizon since it does not use approximations or PPN expansions. More generally, the rPS can be adapted as detectors for the gravitational field and we shall briefly discuss their role in testing different theoretical settings for gravity. In fact, rPS is a natural candidate for a canonical method to extract observables out of a gravitational theory, an activity also known as designing experiments to test gravity.
△ Less
Submitted 26 February, 2020; v1 submitted 12 May, 2018;
originally announced May 2018.
-
Random Sequential Adsorption on mobile patches
Authors:
Diogo E. P. Pinto,
Nuno A. M. Araujo
Abstract:
An extension of the Random Sequential Adsorption (RSA) model has been proposed recently, motivated by the coverage of oil droplets by DNA-functionalized colloidal particles. Particles arrive to a flat substrate with a uniform flux F but they can only adsorb on patches. Patches diffuse on the substrate with a diffusion coefficient D if they are free and they remain immobile when attached to an adso…
▽ More
An extension of the Random Sequential Adsorption (RSA) model has been proposed recently, motivated by the coverage of oil droplets by DNA-functionalized colloidal particles. Particles arrive to a flat substrate with a uniform flux F but they can only adsorb on patches. Patches diffuse on the substrate with a diffusion coefficient D if they are free and they remain immobile when attached to an adsorbed particle. The adsorption is considered irreversible and particles cannot adsorb on top of each other. Thus, the system reaches a jammed state, consisting of a monolayer where no more particles can adsorb. We performed Monte Carlo simulations to study the adsorption kinetics and jammed-state morphology on a one-dimensional lattice. We show that, while the time-dependence of the coverage depends on F and D, the jammed-state coverage depends solely on the ratio F/D. This result is grasped by a simple mean-field calculation. We also report two different regimes for the functional dependence of the jammed-state coverage on the size of the particles, for low and high density of patches.
△ Less
Submitted 18 April, 2018;
originally announced April 2018.
-
The structure of doubly non-commuting isometries
Authors:
Marcel de Jeu,
Paulo R. Pinto
Abstract:
Suppose that $n\geq 1$ and that, for all $i$ and $j$ with $1\leq i,j\leq n$ and $i\neq j$, $z_{ij}\in{\mathbb T}$ are given such that $z_{ji}=\overline{z}_{ij}$ for all $i\neq j$. If $V_1,\dotsc, V_n$ are isometries on a Hilbert space such that $V_i^\ast V_j^{\phantom{\ast}}\!=\overline{z}_{ij} V_j^{\phantom{\ast}}\!V_i^\ast$ for all $i\neq j$, then $(V_1,\dotsc,V_n)$ is called an $n$-tuple of dou…
▽ More
Suppose that $n\geq 1$ and that, for all $i$ and $j$ with $1\leq i,j\leq n$ and $i\neq j$, $z_{ij}\in{\mathbb T}$ are given such that $z_{ji}=\overline{z}_{ij}$ for all $i\neq j$. If $V_1,\dotsc, V_n$ are isometries on a Hilbert space such that $V_i^\ast V_j^{\phantom{\ast}}\!=\overline{z}_{ij} V_j^{\phantom{\ast}}\!V_i^\ast$ for all $i\neq j$, then $(V_1,\dotsc,V_n)$ is called an $n$-tuple of doubly non-commuting isometries. The generators of non-commutative tori are well-known examples. In this paper, we establish a simultaneous Wold decomposition for $(V_1,\dotsc,V_n)$. This decomposition enables us to classify such $n$-tuples up to unitary equivalence. We show that the joint listing of a unitary equivalence class of a representation of each of the $2^n$ non-commutative tori that are naturally associated with the structure constants is a classifying invariant. A dilation theorem is also established, showing that an $n$-tuple of doubly non-commuting isometries can be extended to an $n$-tuple of doubly non-commuting unitary operators on an envelo** Hilbert space.
△ Less
Submitted 31 March, 2020; v1 submitted 29 January, 2018;
originally announced January 2018.
-
The Abacus Cosmos: A Suite of Cosmological N-body Simulations
Authors:
Lehman H. Garrison,
Daniel J. Eisenstein,
Douglas Ferrer,
Jeremy L. Tinker,
Philip A. Pinto,
David H. Weinberg
Abstract:
We present a public data release of halo catalogs from a suite of 125 cosmological $N$-body simulations from the Abacus project. The simulations span 40 $w$CDM cosmologies centered on the Planck 2015 cosmology at two mass resolutions, $4\times 10^{10}\;h^{-1}M_\odot$ and $1\times 10^{10}\;h^{-1}M_\odot$, in $1.1\;h^{-1}\mathrm{Gpc}$ and $720\;h^{-1}\mathrm{Mpc}$ boxes, respectively. The boxes are…
▽ More
We present a public data release of halo catalogs from a suite of 125 cosmological $N$-body simulations from the Abacus project. The simulations span 40 $w$CDM cosmologies centered on the Planck 2015 cosmology at two mass resolutions, $4\times 10^{10}\;h^{-1}M_\odot$ and $1\times 10^{10}\;h^{-1}M_\odot$, in $1.1\;h^{-1}\mathrm{Gpc}$ and $720\;h^{-1}\mathrm{Mpc}$ boxes, respectively. The boxes are phase-matched to suppress sample variance and isolate cosmology dependence. Additional volume is available via 16 boxes of fixed cosmology and varied phase; a few boxes of single-parameter excursions from Planck 2015 are also provided. Catalogs spanning $z=1.5$ to $0.1$ are available for friends-of-friends and Rockstar halo finders and include particle subsamples. All data products are available at https://lgarrison.github.io/AbacusCosmos
△ Less
Submitted 23 April, 2018; v1 submitted 15 December, 2017;
originally announced December 2017.
-
The Hilbert series of $\operatorname{SL}_2$-invariants
Authors:
Pedro de Carvalho Cayres Pinto,
Hans-Christian Herbig,
Daniel Herden,
Christopher Seaton
Abstract:
Let $V$ be a finite dimensional representations of the group $\operatorname{SL}_2$ of $2\times 2$ matrices with complex coefficients and determinant one. Let $R=\mathbb{C}[V]^{\operatorname{SL}_2}$ be the algebra of $\operatorname{SL}_2$-invariant polynomials on $V$. We present a calculation of the Hilbert series $\operatorname{Hilb}_R(t)=\sum_{n\ge 0}\dim (R_n)\: t^n$ as well as formulas for the…
▽ More
Let $V$ be a finite dimensional representations of the group $\operatorname{SL}_2$ of $2\times 2$ matrices with complex coefficients and determinant one. Let $R=\mathbb{C}[V]^{\operatorname{SL}_2}$ be the algebra of $\operatorname{SL}_2$-invariant polynomials on $V$. We present a calculation of the Hilbert series $\operatorname{Hilb}_R(t)=\sum_{n\ge 0}\dim (R_n)\: t^n$ as well as formulas for the first four coefficients of the Laurent expansion of $\operatorname{Hilb}_R(t)$ at $t=1$.
△ Less
Submitted 19 June, 2018; v1 submitted 6 October, 2017;
originally announced October 2017.