-
The S-PLUS Fornax Project (S+FP): A first 12-band glimpse of the Fornax galaxy cluster
Authors:
A. V. Smith Castelli,
A. Cortesi,
R. F. Haack,
A. R. Lopes,
J. Thainá-Batista,
R. Cid Fernandes,
L. Lomelí-Núñez,
U. Ribeiro,
C. R. de Bom,
V. Cernic,
L. Sodré Jr,
L. Zenocratti,
M. E. De Rossi,
J. P. Calderón,
F. Herpich,
E. Telles,
K. Saha,
P. A. A. Lopes,
V. H. Lopes-Silva,
T. S. Gonçalves,
D. Bambrila,
N. M. Cardoso,
M. L. Buzzo,
P. Astudillo Sotomayor,
R. Demarco
, et al. (18 additional authors not shown)
Abstract:
The Fornax galaxy cluster is the richest nearby (D ~ 20 Mpc) galaxy association in the southern sky. As such, it provides a wealth of oportunities to elucidate on the processes where environment holds a key role in transforming galaxies. Although it has been the focus of many studies, Fornax has never been explored with contiguous homogeneous wide-field imaging in 12 photometric narrow- and broad-…
▽ More
The Fornax galaxy cluster is the richest nearby (D ~ 20 Mpc) galaxy association in the southern sky. As such, it provides a wealth of oportunities to elucidate on the processes where environment holds a key role in transforming galaxies. Although it has been the focus of many studies, Fornax has never been explored with contiguous homogeneous wide-field imaging in 12 photometric narrow- and broad-bands like those provided by the Southern Photometric Local Universe Survey (S-PLUS). In this paper we present the S-PLUS Fornax Project (S+FP) that aims to comprehensively analyse the galaxy content of the Fornax cluster using S-PLUS. Our data set consists of 106 S-PLUS wide-field frames (FoV ~ 1.4 x 1.4 deg$^2$) observed in five SDSS-like ugriz broad-bands and seven narrow-bands covering specific spectroscopic features like [OII], CaII H+K, H$δ$, G-band, Mg b triplet, H$α$, and the CaII triplet. Based on S-PLUS specific automated photometry, aimed at correctly detecting Fornax galaxies and globular clusters in S-PLUS images, our dataset provides the community with catalogues containing homogeneous 12-band photometry for ~ 3 x 10$^6$ resolved and unresolved objects within a region extending over ~ 208 deg$^2$ (~ 5 Rvir in RA) around Fornax' central galaxy, NGC 1399. We further explore the EAGLE and IllustrisTNG cosmological simulations to identify 45 Fornax-like clusters and generate mock images on all 12 S-PLUS bands of these structures down to galaxies with M$\star \geq 10^8$ M$\odot$. The S+FP dataset we put forward in this first paper of a series will enable a variety of studies some of which are briefly presented.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
A lanthanide-rich kilonova in the aftermath of a long gamma-ray burst
Authors:
Yu-Han Yang,
Eleonora Troja,
Brendan O'Connor,
Chris L. Fryer,
Myungshin Im,
Joe Durbak,
Gregory S. H. Paek,
Roberto Ricci,
Clécio R. De Bom,
James H. Gillanders,
Alberto J. Castro-Tirado,
Zong-Kai Peng,
Simone Dichiara,
Geoffrey Ryan,
Hendrik van Eerten,
Zi-Gao Dai,
Seo-Won Chang,
Hyeonho Choi,
Kishalay De,
Youdong Hu,
Charles D. Kilpatrick,
Alexander Kutyrev,
Mankeun Jeong,
Chung-Uk Lee,
Martin Makler
, et al. (2 additional authors not shown)
Abstract:
Kilonovae are a rare class of astrophysical transients powered by the radioactive decay of nuclei heavier than iron, synthesized in the merger of two compact objects. Over the first few days, the kilonova evolution is dominated by a large number of radioactive isotopes contributing to the heating rate. On timescales of weeks to months, its behavior is predicted to differ depending on the ejecta co…
▽ More
Kilonovae are a rare class of astrophysical transients powered by the radioactive decay of nuclei heavier than iron, synthesized in the merger of two compact objects. Over the first few days, the kilonova evolution is dominated by a large number of radioactive isotopes contributing to the heating rate. On timescales of weeks to months, its behavior is predicted to differ depending on the ejecta composition and merger remnant. However, late-time observations of known kilonovae are either missing or limited. Here we report observations of a luminous red transient with a quasi-thermal spectrum, following an unusual gamma-ray burst of long duration. We classify this thermal emission as a kilonova and track its evolution up to two months after the burst. At these late times, the recession of the photospheric radius and the rapidly-decaying bolometric luminosity ($L_{\rm bol}\propto t^{-2.7\pm 0.4}$) support the recombination of lanthanide-rich ejecta as they cool.
△ Less
Submitted 2 August, 2023; v1 submitted 1 August, 2023;
originally announced August 2023.
-
Changing Data Sources in the Age of Machine Learning for Official Statistics
Authors:
Cedric De Boom,
Michael Reusens
Abstract:
Data science has become increasingly essential for the production of official statistics, as it enables the automated collection, processing, and analysis of large amounts of data. With such data science practices in place, it enables more timely, more insightful and more flexible reporting. However, the quality and integrity of data-science-driven statistics rely on the accuracy and reliability o…
▽ More
Data science has become increasingly essential for the production of official statistics, as it enables the automated collection, processing, and analysis of large amounts of data. With such data science practices in place, it enables more timely, more insightful and more flexible reporting. However, the quality and integrity of data-science-driven statistics rely on the accuracy and reliability of the data sources and the machine learning techniques that support them. In particular, changes in data sources are inevitable to occur and pose significant risks that are crucial to address in the context of machine learning for official statistics.
This paper gives an overview of the main risks, liabilities, and uncertainties associated with changing data sources in the context of machine learning for official statistics. We provide a checklist of the most prevalent origins and causes of changing data sources; not only on a technical level but also regarding ownership, ethics, regulation, and public perception. Next, we highlight the repercussions of changing data sources on statistical reporting. These include technical effects such as concept drift, bias, availability, validity, accuracy and completeness, but also the neutrality and potential discontinuation of the statistical offering. We offer a few important precautionary measures, such as enhancing robustness in both data sourcing and statistical techniques, and thorough monitoring. In doing so, machine learning-based official statistics can maintain integrity, reliability, consistency, and relevance in policy-making, decision-making, and public discourse.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Neural Bayesian Network Understudy
Authors:
Paloma Rabaey,
Cedric De Boom,
Thomas Demeester
Abstract:
Bayesian Networks may be appealing for clinical decision-making due to their inclusion of causal knowledge, but their practical adoption remains limited as a result of their inability to deal with unstructured data. While neural networks do not have this limitation, they are not interpretable and are inherently unable to deal with causal structure in the input space. Our goal is to build neural ne…
▽ More
Bayesian Networks may be appealing for clinical decision-making due to their inclusion of causal knowledge, but their practical adoption remains limited as a result of their inability to deal with unstructured data. While neural networks do not have this limitation, they are not interpretable and are inherently unable to deal with causal structure in the input space. Our goal is to build neural networks that combine the advantages of both approaches. Motivated by the perspective to inject causal knowledge while training such neural networks, this work presents initial steps in that direction. We demonstrate how a neural network can be trained to output conditional probabilities, providing approximately the same functionality as a Bayesian Network. Additionally, we propose two training strategies that allow encoding the independence relations inferred from a given causal structure into the neural network. We present initial results in a proof-of-concept setting, showing that the neural model acts as an understudy to its Bayesian Network counterpart, approximating its probabilistic and causal properties.
△ Less
Submitted 15 November, 2022;
originally announced November 2022.
-
Audio-guided Album Cover Art Generation with Genetic Algorithms
Authors:
James Marien,
Sam Leroux,
Bart Dhoedt,
Cedric De Boom
Abstract:
Over 60,000 songs are released on Spotify every day, and the competition for the listener's attention is immense. In that regard, the importance of captivating and inviting cover art cannot be underestimated, because it is deeply entangled with a song's character and the artist's identity, and remains one of the most important gateways to lead people to discover music. However, designing cover art…
▽ More
Over 60,000 songs are released on Spotify every day, and the competition for the listener's attention is immense. In that regard, the importance of captivating and inviting cover art cannot be underestimated, because it is deeply entangled with a song's character and the artist's identity, and remains one of the most important gateways to lead people to discover music. However, designing cover art is a highly creative, lengthy and sometimes expensive process that can be daunting, especially for non-professional artists. For this reason, we propose a novel deep-learning framework to generate cover art guided by audio features. Inspired by VQGAN-CLIP, our approach is highly flexible because individual components can easily be replaced without the need for any retraining. This paper outlines the architectural details of our models and discusses the optimization challenges that emerge from them. More specifically, we will exploit genetic algorithms to overcome bad local minima and adversarial examples. We find that our framework can generate suitable cover art for most genres, and that the visual features adapt themselves to audio feature changes. Given these results, we believe that our framework paves the road for extensions and more advanced applications in audio-guided visual generation tasks.
△ Less
Submitted 14 July, 2022;
originally announced July 2022.
-
SOAR/Goodman Spectroscopic Assessment of Candidate Counterparts of the LIGO-Virgo Event GW190814
Authors:
Douglas Tucker,
Matthew Wiesner,
Sahar Allam,
Marcelle Soares-Santos,
Clecio de Bom,
Melissa Butner,
Alyssa Garcia,
Robert Morgan,
Felipe Olivares,
Antonella Palmese,
Luidhy Santana-Silva,
Anushka Shrivastava,
James Annis,
Juan Garcia-Bellido,
Mandeep Gill,
Kenneth Herner,
Charles Kilpatrick,
Martin Makler,
Nora Sherman,
Adam Amara,
Huan Lin,
Mathew Smith,
Elizabeth Swann,
Iair Arcavi,
Tristan Bachmann
, et al. (118 additional authors not shown)
Abstract:
On 2019 August 14 at 21:10:39 UTC, the LIGO/Virgo Collaboration (LVC) detected a possible neutron star-black hole merger (NSBH), the first ever identified. An extensive search for an optical counterpart of this event, designated GW190814, was undertaken using the Dark Energy Camera (DECam) on the 4m Victor M. Blanco Telescope at the Cerro Tololo Inter-American Observatory. Target of Opportunity in…
▽ More
On 2019 August 14 at 21:10:39 UTC, the LIGO/Virgo Collaboration (LVC) detected a possible neutron star-black hole merger (NSBH), the first ever identified. An extensive search for an optical counterpart of this event, designated GW190814, was undertaken using the Dark Energy Camera (DECam) on the 4m Victor M. Blanco Telescope at the Cerro Tololo Inter-American Observatory. Target of Opportunity interrupts were issued on 8 separate nights to observe 11 candidates using the 4.1m Southern Astrophysical Research (SOAR) telescope's Goodman High Throughput Spectrograph in order to assess whether any of these transients was likely to be an optical counterpart of the possible NSBH merger. Here, we describe the process of observing with SOAR, the analysis of our spectra, our spectroscopic ty** methodology, and our resultant conclusion that none of the candidates corresponded to the gravitational wave merger event but were all instead other transients. Finally, we describe the lessons learned from this effort. Application of these lessons will be critical for a successful community spectroscopic follow-up program for LVC observing run 4 (O4) and beyond.
△ Less
Submitted 2 June, 2022; v1 submitted 27 September, 2021;
originally announced September 2021.
-
A learning gap between neuroscience and reinforcement learning
Authors:
Samuel T. Wauthier,
Pietro Mazzaglia,
Ozan Çatal,
Cedric De Boom,
Tim Verbelen,
Bart Dhoedt
Abstract:
Historically, artificial intelligence has drawn much inspiration from neuroscience to fuel advances in the field. However, current progress in reinforcement learning is largely focused on benchmark problems that fail to capture many of the aspects that are of interest in neuroscience today. We illustrate this point by extending a T-maze task from neuroscience for use with reinforcement learning al…
▽ More
Historically, artificial intelligence has drawn much inspiration from neuroscience to fuel advances in the field. However, current progress in reinforcement learning is largely focused on benchmark problems that fail to capture many of the aspects that are of interest in neuroscience today. We illustrate this point by extending a T-maze task from neuroscience for use with reinforcement learning algorithms, and show that state-of-the-art algorithms are not capable of solving this problem. Finally, we point out where insights from neuroscience could help explain some of the issues encountered.
△ Less
Submitted 4 May, 2021; v1 submitted 22 April, 2021;
originally announced April 2021.
-
The Fornax Cluster through S-PLUS
Authors:
A. V. Smith Castelli,
C. Mendes de Oliveira,
F. Herpich,
C. E. Barbosa,
C. Escudero,
M. Grossi,
L. Sodre,
C. R. de Bom,
L. Zenocratti,
M. E. De Rossi,
A. Cortesi,
R. Cid Fernandes,
A. R. Lopes,
E. Telles,
G. B. Oliveira Schwarz,
M. L. L. Dantas,
F. R. Faifer,
A. Chies Santos,
J. Saponara,
V. Reynaldi,
I. Andruchow,
L. Sesto,
M. F. Mestre,
A. L. de Amorim,
E. V. R. de Lima
, et al. (3 additional authors not shown)
Abstract:
The Southern Photometric Local Universe Survey (S-PLUS) aims to map $\approx$ 9300 deg$^2$ of the Southern sky using the Javalambre filter system of 12 optical bands, 5 Sloan-like filters and 7 narrow-band filters centered on several prominent stellar features ([OII], Ca H+K, D4000, H$_δ$, Mgb, H$_α$ and CaT). S-PLUS is carried out with the T80-South, a new robotic 0.826-m telescope located on CTI…
▽ More
The Southern Photometric Local Universe Survey (S-PLUS) aims to map $\approx$ 9300 deg$^2$ of the Southern sky using the Javalambre filter system of 12 optical bands, 5 Sloan-like filters and 7 narrow-band filters centered on several prominent stellar features ([OII], Ca H+K, D4000, H$_δ$, Mgb, H$_α$ and CaT). S-PLUS is carried out with the T80-South, a new robotic 0.826-m telescope located on CTIO, equipped with a wide FoV camera (2 deg$^2$). In this poster we introduce project #59 of the S-PLUS collaboration aimed at studying the Fornax galaxy cluster covering an sky area of $\approx$ 11 $\times$ 7 deg$^2$, and with homogeneous photometry in the 12 optical bands of S-PLUS (Coordinator: A. Smith Castelli).
△ Less
Submitted 15 April, 2021;
originally announced April 2021.
-
Dynamic Narrowing of VAE Bottlenecks Using GECO and L0 Regularization
Authors:
Cedric De Boom,
Samuel Wauthier,
Tim Verbelen,
Bart Dhoedt
Abstract:
When designing variational autoencoders (VAEs) or other types of latent space models, the dimensionality of the latent space is typically defined upfront. In this process, it is possible that the number of dimensions is under- or overprovisioned for the application at hand. In case the dimensionality is not predefined, this parameter is usually determined using time- and resource-consuming cross-v…
▽ More
When designing variational autoencoders (VAEs) or other types of latent space models, the dimensionality of the latent space is typically defined upfront. In this process, it is possible that the number of dimensions is under- or overprovisioned for the application at hand. In case the dimensionality is not predefined, this parameter is usually determined using time- and resource-consuming cross-validation. For these reasons we have developed a technique to shrink the latent space dimensionality of VAEs automatically and on-the-fly during training using Generalized ELBO with Constrained Optimization (GECO) and the $L_0$-Augment-REINFORCE-Merge ($L_0$-ARM) gradient estimator. The GECO optimizer ensures that we are not violating a predefined upper bound on the reconstruction error. This paper presents the algorithmic details of our method along with experimental results on five different datasets. We find that our training procedure is stable and that the latent space can be pruned effectively without violating the GECO constraints.
△ Less
Submitted 13 April, 2021; v1 submitted 24 March, 2020;
originally announced March 2020.
-
Deep Active Inference for Autonomous Robot Navigation
Authors:
Ozan Çatal,
Samuel Wauthier,
Tim Verbelen,
Cedric De Boom,
Bart Dhoedt
Abstract:
Active inference is a theory that underpins the way biological agent's perceive and act in the real world. At its core, active inference is based on the principle that the brain is an approximate Bayesian inference engine, building an internal generative model to drive agents towards minimal surprise. Although this theory has shown interesting results with grounding in cognitive neuroscience, its…
▽ More
Active inference is a theory that underpins the way biological agent's perceive and act in the real world. At its core, active inference is based on the principle that the brain is an approximate Bayesian inference engine, building an internal generative model to drive agents towards minimal surprise. Although this theory has shown interesting results with grounding in cognitive neuroscience, its application remains limited to simulations with small, predefined sensor and state spaces.
In this paper, we leverage recent advances in deep learning to build more complex generative models that can work without a predefined states space. State representations are learned end-to-end from real-world, high-dimensional sensory data such as camera frames. We also show that these generative models can be used to engage in active inference. To the best of our knowledge this is the first application of deep active inference for a real-world robot navigation task.
△ Less
Submitted 6 March, 2020;
originally announced March 2020.
-
Rhythm, Chord and Melody Generation for Lead Sheets using Recurrent Neural Networks
Authors:
Cedric De Boom,
Stephanie Van Laere,
Tim Verbelen,
Bart Dhoedt
Abstract:
Music that is generated by recurrent neural networks often lacks a sense of direction and coherence. We therefore propose a two-stage LSTM-based model for lead sheet generation, in which the harmonic and rhythmic templates of the song are produced first, after which, in a second stage, a sequence of melody notes is generated conditioned on these templates. A subjective listening test shows that ou…
▽ More
Music that is generated by recurrent neural networks often lacks a sense of direction and coherence. We therefore propose a two-stage LSTM-based model for lead sheet generation, in which the harmonic and rhythmic templates of the song are produced first, after which, in a second stage, a sequence of melody notes is generated conditioned on these templates. A subjective listening test shows that our approach outperforms the baselines and increases perceived musical coherence.
△ Less
Submitted 21 February, 2020;
originally announced February 2020.
-
Learning Perception and Planning with Deep Active Inference
Authors:
Ozan Çatal,
Tim Verbelen,
Johannes Nauta,
Cedric De Boom,
Bart Dhoedt
Abstract:
Active inference is a process theory of the brain that states that all living organisms infer actions in order to minimize their (expected) free energy. However, current experiments are limited to predefined, often discrete, state spaces. In this paper we use recent advances in deep learning to learn the state space and approximate the necessary probability distributions to engage in active infere…
▽ More
Active inference is a process theory of the brain that states that all living organisms infer actions in order to minimize their (expected) free energy. However, current experiments are limited to predefined, often discrete, state spaces. In this paper we use recent advances in deep learning to learn the state space and approximate the necessary probability distributions to engage in active inference.
△ Less
Submitted 24 February, 2020; v1 submitted 30 January, 2020;
originally announced January 2020.
-
Learning to Grasp from a Single Demonstration
Authors:
Pieter Van Molle,
Tim Verbelen,
Elias De Coninck,
Cedric De Boom,
Pieter Simoens,
Bart Dhoedt
Abstract:
Learning-based approaches for robotic gras** using visual sensors typically require collecting a large size dataset, either manually labeled or by many trial and errors of a robotic manipulator in the real or simulated world. We propose a simpler learning-from-demonstration approach that is able to detect the object to grasp from merely a single demonstration using a convolutional neural network…
▽ More
Learning-based approaches for robotic gras** using visual sensors typically require collecting a large size dataset, either manually labeled or by many trial and errors of a robotic manipulator in the real or simulated world. We propose a simpler learning-from-demonstration approach that is able to detect the object to grasp from merely a single demonstration using a convolutional neural network we call GraspNet. In order to increase robustness and decrease the training time even further, we leverage data from previous demonstrations to quickly fine-tune a GrapNet for each new demonstration. We present some preliminary results on a gras** experiment with the Franka Panda cobot for which we can train a GraspNet with only hundreds of train iterations.
△ Less
Submitted 9 June, 2018;
originally announced June 2018.
-
Character-level Recurrent Neural Networks in Practice: Comparing Training and Sampling Schemes
Authors:
Cedric De Boom,
Thomas Demeester,
Bart Dhoedt
Abstract:
Recurrent neural networks are nowadays successfully used in an abundance of applications, going from text, speech and image processing to recommender systems. Backpropagation through time is the algorithm that is commonly used to train these networks on specific tasks. Many deep learning frameworks have their own implementation of training and sampling procedures for recurrent neural networks, whi…
▽ More
Recurrent neural networks are nowadays successfully used in an abundance of applications, going from text, speech and image processing to recommender systems. Backpropagation through time is the algorithm that is commonly used to train these networks on specific tasks. Many deep learning frameworks have their own implementation of training and sampling procedures for recurrent neural networks, while there are in fact multiple other possibilities to choose from and other parameters to tune. In existing literature this is very often overlooked or ignored. In this paper we therefore give an overview of possible training and sampling schemes for character-level recurrent neural networks to solve the task of predicting the next token in a given sequence. We test these different schemes on a variety of datasets, neural network architectures and parameter settings, and formulate a number of take-home recommendations. The choice of training and sampling scheme turns out to be subject to a number of trade-offs, such as training stability, sampling time, model performance and implementation effort, but is largely independent of the data. Perhaps the most surprising result is that transferring hidden states for correctly initializing the model on subsequences often leads to unstable training behavior depending on the dataset.
△ Less
Submitted 9 January, 2018; v1 submitted 2 January, 2018;
originally announced January 2018.
-
Large-Scale User Modeling with Recurrent Neural Networks for Music Discovery on Multiple Time Scales
Authors:
Cedric De Boom,
Rohan Agrawal,
Samantha Hansen,
Esh Kumar,
Romain Yon,
Ching-Wei Chen,
Thomas Demeester,
Bart Dhoedt
Abstract:
The amount of content on online music streaming platforms is immense, and most users only access a tiny fraction of this content. Recommender systems are the application of choice to open up the collection to these users. Collaborative filtering has the disadvantage that it relies on explicit ratings, which are often unavailable, and generally disregards the temporal nature of music consumption. O…
▽ More
The amount of content on online music streaming platforms is immense, and most users only access a tiny fraction of this content. Recommender systems are the application of choice to open up the collection to these users. Collaborative filtering has the disadvantage that it relies on explicit ratings, which are often unavailable, and generally disregards the temporal nature of music consumption. On the other hand, item co-occurrence algorithms, such as the recently introduced word2vec-based recommenders, are typically left without an effective user representation. In this paper, we present a new approach to model users through recurrent neural networks by sequentially processing consumed items, represented by any type of embeddings and other context features. This way we obtain semantically rich user representations, which capture a user's musical taste over time. Our experimental analysis on large-scale user data shows that our model can be used to predict future songs a user will likely listen to, both in the short and long term.
△ Less
Submitted 22 August, 2017;
originally announced August 2017.
-
On a method for Rock Classification using Textural Features and Genetic Optimization
Authors:
Manuel Blanco Valentin,
Clecio Roque De Bom,
Marcio Portes de Albuquerque,
Marcelo Portes de Albuquerque,
Elisangela Faria,
Maury Duarte Correia,
Rodrigo Surmas
Abstract:
In this work we present a method to classify a set of rock textures based on a Spectral Analysis and the extraction of the texture Features of the resulted images. Up to 520 features were tested using 4 different filters and all 31 different combinations were verified. The classification process relies on a Naive Bayes classifier. We performed two kinds of optimizations: statistical optimization w…
▽ More
In this work we present a method to classify a set of rock textures based on a Spectral Analysis and the extraction of the texture Features of the resulted images. Up to 520 features were tested using 4 different filters and all 31 different combinations were verified. The classification process relies on a Naive Bayes classifier. We performed two kinds of optimizations: statistical optimization with covariance-based Principal Component Analysis (PCA) and a genetic optimization, for 10,000 randomly defined samples, achieving a final maximum classification success of 91% against the original 70% success ratio (without any optimization nor filters used). After the optimization 9 types of features emerged as most relevant.
△ Less
Submitted 17 August, 2017; v1 submitted 6 July, 2016;
originally announced July 2016.
-
Representation learning for very short texts using weighted word embedding aggregation
Authors:
Cedric De Boom,
Steven Van Canneyt,
Thomas Demeester,
Bart Dhoedt
Abstract:
Short text messages such as tweets are very noisy and sparse in their use of vocabulary. Traditional textual representations, such as tf-idf, have difficulty gras** the semantic meaning of such texts, which is important in applications such as event detection, opinion mining, news recommendation, etc. We constructed a method based on semantic word embeddings and frequency information to arrive a…
▽ More
Short text messages such as tweets are very noisy and sparse in their use of vocabulary. Traditional textual representations, such as tf-idf, have difficulty gras** the semantic meaning of such texts, which is important in applications such as event detection, opinion mining, news recommendation, etc. We constructed a method based on semantic word embeddings and frequency information to arrive at low-dimensional representations for short texts designed to capture semantic similarity. For this purpose we designed a weight-based model and a learning procedure based on a novel median-based loss function. This paper discusses the details of our model and the optimization methods, together with the experimental results on both Wikipedia and Twitter data. We find that our method outperforms the baseline approaches in the experiments, and that it generalizes well on different word embeddings without retraining. Our method is therefore capable of retaining most of the semantic information in the text, and is applicable out-of-the-box.
△ Less
Submitted 2 July, 2016;
originally announced July 2016.
-
Lazy Evaluation of Convolutional Filters
Authors:
Sam Leroux,
Steven Bohez,
Cedric De Boom,
Elias De Coninck,
Tim Verbelen,
Bert Vankeirsbilck,
Pieter Simoens,
Bart Dhoedt
Abstract:
In this paper we propose a technique which avoids the evaluation of certain convolutional filters in a deep neural network. This allows to trade-off the accuracy of a deep neural network with the computational and memory requirements. This is especially important on a constrained device unable to hold all the weights of the network in memory.
In this paper we propose a technique which avoids the evaluation of certain convolutional filters in a deep neural network. This allows to trade-off the accuracy of a deep neural network with the computational and memory requirements. This is especially important on a constrained device unable to hold all the weights of the network in memory.
△ Less
Submitted 27 May, 2016;
originally announced May 2016.
-
Efficiency Evaluation of Character-level RNN Training Schedules
Authors:
Cedric De Boom,
Sam Leroux,
Steven Bohez,
Pieter Simoens,
Thomas Demeester,
Bart Dhoedt
Abstract:
We present four training and prediction schedules from the same character-level recurrent neural network. The efficiency of these schedules is tested in terms of model effectiveness as a function of training time and amount of training data seen. We show that the choice of training and prediction schedule potentially has a considerable impact on the prediction effectiveness for a given training bu…
▽ More
We present four training and prediction schedules from the same character-level recurrent neural network. The efficiency of these schedules is tested in terms of model effectiveness as a function of training time and amount of training data seen. We show that the choice of training and prediction schedule potentially has a considerable impact on the prediction effectiveness for a given training budget.
△ Less
Submitted 9 May, 2016;
originally announced May 2016.
-
Observation and Confirmation of Six Strong Lensing Systems in The Dark Energy Survey Science Verification Data
Authors:
B. Nord,
E. Buckley-Geer,
H. Lin,
H. T. Diehl,
J. Helsby,
N. Kuropatkin,
A. Amara,
T. Collett,
S. Allam,
G. Caminha,
C. De Bom,
S. Desai,
H. Dúmet-Montoya,
M. Elidaiana da S. Pereira,
D. A. Finley,
B. Flaugher,
C. Furlanetto,
H. Gaitsch,
M. Gill,
K. W. Merritt,
A. More,
D. Tucker,
E. S. Rykoff,
E. Rozo,
F. B. Abdalla
, et al. (67 additional authors not shown)
Abstract:
We report the observation and confirmation of the first group- and cluster-scale strong gravitational lensing systems found in Dark Energy Survey (DES) data. Through visual inspection of data from the Science Verification (SV) season, we identified 53 candidate systems. We then obtained spectroscopic follow-up of 21 candidates using the Gemini Multi-Object Spectrograph (GMOS) at the Gemini South t…
▽ More
We report the observation and confirmation of the first group- and cluster-scale strong gravitational lensing systems found in Dark Energy Survey (DES) data. Through visual inspection of data from the Science Verification (SV) season, we identified 53 candidate systems. We then obtained spectroscopic follow-up of 21 candidates using the Gemini Multi-Object Spectrograph (GMOS) at the Gemini South telescope and the Inamori-Magellan Areal Camera and Spectrograph (IMACS) at the Magellan/Baade telescope. With this follow-up, we confirmed six candidates as gravitational lenses: Three of the systems are newly discovered, and the remaining three were previously known. Of the 21 observed candidates, the remaining 15 were either not detected in spectroscopic observations, were observed and did not exhibit continuum emission (or spectral features), or were ruled out as lensing systems. The confirmed sample consists of one group-scale and five galaxy cluster-scale lenses. The lensed sources range in redshift z ~ 0.80-3.2, and in i-band surface brightness i_{SB} ~ 23-25 mag/sq.-arcsec. (2" aperture). For each of the six systems, we estimate the Einstein radius and the enclosed mass, which have ranges ~ 5.0 - 8.6" and ~ 7.5 x 10^{12} - 6.4 x 10^{13} solar masses, respectively.
△ Less
Submitted 9 December, 2015;
originally announced December 2015.
-
Learning Semantic Similarity for Very Short Texts
Authors:
Cedric De Boom,
Steven Van Canneyt,
Steven Bohez,
Thomas Demeester,
Bart Dhoedt
Abstract:
Levering data on social media, such as Twitter and Facebook, requires information retrieval algorithms to become able to relate very short text fragments to each other. Traditional text similarity methods such as tf-idf cosine-similarity, based on word overlap, mostly fail to produce good results in this case, since word overlap is little or non-existent. Recently, distributed word representations…
▽ More
Levering data on social media, such as Twitter and Facebook, requires information retrieval algorithms to become able to relate very short text fragments to each other. Traditional text similarity methods such as tf-idf cosine-similarity, based on word overlap, mostly fail to produce good results in this case, since word overlap is little or non-existent. Recently, distributed word representations, or word embeddings, have been shown to successfully allow words to match on the semantic level. In order to pair short text fragments - as a concatenation of separate words - an adequate distributed sentence representation is needed, in existing literature often obtained by naively combining the individual word representations. We therefore investigated several text representations as a combination of word embeddings in the context of semantic pair matching. This paper investigates the effectiveness of several such naive techniques, as well as traditional tf-idf similarity, for fragments of different lengths. Our main contribution is a first step towards a hybrid method that combines the strength of dense distributed representations - as opposed to sparse term matching - with the strength of tf-idf based methods to automatically reduce the impact of less informative terms. Our new approach outperforms the existing techniques in a toy experimental set-up, leading to the conclusion that the combination of word embeddings and tf-idf information might lead to a better model for semantic content within very short text fragments.
△ Less
Submitted 2 December, 2015;
originally announced December 2015.
-
A simple prescription for simulating and characterizing gravitational arcs
Authors:
Cristina Furlanetto,
Basílio X. Santiago,
Martín Makler,
Clécio de Bom,
Carlos H. Brandt,
Angelo Fausti Neto,
Pedro C. Ferreira,
Luiz Nicolaci da Costa,
Marcio A. G. Maia
Abstract:
Simple models of gravitational arcs are crucial to simulate large samples of these objects with full control of the input parameters. These models also provide crude and automated estimates of the shape and structure of the arcs, which are necessary when trying to detect and characterize these objects on massive wide area imaging surveys. We here present and explore the ArcEllipse, a simple prescr…
▽ More
Simple models of gravitational arcs are crucial to simulate large samples of these objects with full control of the input parameters. These models also provide crude and automated estimates of the shape and structure of the arcs, which are necessary when trying to detect and characterize these objects on massive wide area imaging surveys. We here present and explore the ArcEllipse, a simple prescription to create objects with shape similar to gravitational arcs. We also present PaintArcs, which is a code that couples this geometrical form with a brightness distribution and adds the resulting object to images. Finally, we introduce ArcFitting, which is a tool that fits ArcEllipses to images of real gravitational arcs. We validate this fitting technique using simulated arcs and apply it to CFHTLS and HST images of tangential arcs around clusters of galaxies. Our simple ArcEllipse model for the arc, associated to a Sérsic profile for the source, recovers the total signal in real images typically within 10%-30%. The ArcEllipse+Sérsic models also automatically recover visual estimates of length-to-width ratios of real arcs. Residual maps between data and model images reveal the incidence of arc substructure. They may thus be used as a diagnostic for arcs formed by the merging of multiple images. The incidence of these substructures is the main factor preventing ArcEllipse models from accurately describing real lensed systems.
△ Less
Submitted 23 January, 2013; v1 submitted 12 November, 2012;
originally announced November 2012.