-
Creating Language-driven Spatial Variations of Icon Images
Authors:
Xianghao Xu,
Aditya Ganeshan,
Karl D. D. Willis,
Yewen Pu,
Daniel Ritchie
Abstract:
Editing 2D icon images can require significant manual effort from designers. It involves manipulating multiple geometries while maintaining the logical or physical coherence of the objects depicted in the image. Previous language driven image editing methods can change the texture and geometry of objects in the image but fail at producing spatial variations, i.e. modifying spatial relations betwee…
▽ More
Editing 2D icon images can require significant manual effort from designers. It involves manipulating multiple geometries while maintaining the logical or physical coherence of the objects depicted in the image. Previous language driven image editing methods can change the texture and geometry of objects in the image but fail at producing spatial variations, i.e. modifying spatial relations between objects while maintaining their identities. We present a language driven editing method that can produce spatial variations of icon images. Our method takes in an icon image along with a user's editing request text prompt and outputs an edited icon image reflecting the user's editing request. Our method is designed based on two key observations: (1) A user's editing requests can be translated by a large language model (LLM), with help from a domain specific language (DSL) library, into to a set of geometrical constraints defining the relationships between segments in an icon image. (2) Optimizing the affine transformations of the segments with respect to these geometrical constraints can produce icon images that fulfill the editing request and preserve overall physical and logical coherence. Quantitative and qualitative results show that our system outperforms multiple baselines, enabling natural editing of icon images.
△ Less
Submitted 29 May, 2024;
originally announced May 2024.
-
BrepGen: A B-rep Generative Diffusion Model with Structured Latent Geometry
Authors:
Xiang Xu,
Joseph G. Lambourne,
Pradeep Kumar Jayaraman,
Zhengqing Wang,
Karl D. D. Willis,
Yasutaka Furukawa
Abstract:
This paper presents BrepGen, a diffusion-based generative approach that directly outputs a Boundary representation (B-rep) Computer-Aided Design (CAD) model. BrepGen represents a B-rep model as a novel structured latent geometry in a hierarchical tree. With the root node representing a whole CAD solid, each element of a B-rep model (i.e., a face, an edge, or a vertex) progressively turns into a ch…
▽ More
This paper presents BrepGen, a diffusion-based generative approach that directly outputs a Boundary representation (B-rep) Computer-Aided Design (CAD) model. BrepGen represents a B-rep model as a novel structured latent geometry in a hierarchical tree. With the root node representing a whole CAD solid, each element of a B-rep model (i.e., a face, an edge, or a vertex) progressively turns into a child-node from top to bottom. B-rep geometry information goes into the nodes as the global bounding box of each primitive along with a latent code describing the local geometric shape. The B-rep topology information is implicitly represented by node duplication. When two faces share an edge, the edge curve will appear twice in the tree, and a T-junction vertex with three incident edges appears six times in the tree with identical node features. Starting from the root and progressing to the leaf, BrepGen employs Transformer-based diffusion models to sequentially denoise node features while duplicated nodes are detected and merged, recovering the B-Rep topology information. Extensive experiments show that BrepGen advances the task of CAD B-rep generation, surpassing existing methods on various benchmarks. Results on our newly collected furniture dataset further showcase its exceptional capability in generating complicated geometry. While previous methods were limited to generating simple prismatic shapes, BrepGen incorporates free-form and doubly-curved surfaces for the first time. Additional applications of BrepGen include CAD autocomplete and design interpolation. The code, pretrained models, and dataset are available at https://github.com/samxuxiang/BrepGen.
△ Less
Submitted 16 May, 2024; v1 submitted 27 January, 2024;
originally announced January 2024.
-
Discovery of Small Ultra-short-period Planets Orbiting KG Dwarfs in Kepler Survey Using GPU Phase Folding and Deep Learning Detection System
Authors:
Kaitlyn Wang,
Jian Ge,
Kevin Willis,
Kevin Wang,
Yinan Zhao
Abstract:
Since the discovery of the first hot Jupiter orbiting a solar-type star, 51 Peg, in 1995, more than 4000 exoplanets have been identified using various observational techniques. The formation process of these sub-Earths remains elusive, and acquiring additional samples is essential for investigating this unique population. In our study, we employ a novel GPU Phase Folding algorithm combined with a…
▽ More
Since the discovery of the first hot Jupiter orbiting a solar-type star, 51 Peg, in 1995, more than 4000 exoplanets have been identified using various observational techniques. The formation process of these sub-Earths remains elusive, and acquiring additional samples is essential for investigating this unique population. In our study, we employ a novel GPU Phase Folding algorithm combined with a Convolutional Neural Network, termed the GPFC method, on Kepler photometry data. This method enhances the transit search speed significantly over the traditional Box-fitting Least Squares method, allowing a complete search of the known KOI photometry data within hours using a commercial GPU card. To date, we have identified five promising sub-Earth short-period candidates: K00446.c, K01821.b, K01522.c, K03404.b, and K04978.b. A closer analysis reveals the following characteristics: K00446.c orbits a K dwarf on a 0.645091-day period. With a radius of $0.461R_\oplus$, it ranks as the second smallest USP discovered to date. K01821.b is a sub-Earth with a radius of $0.648R_\oplus$, orbiting a G dwarf over a 0.91978-day period. It is the second smallest USP among all confirmed USPs orbiting G dwarfs in the NASA Archive. K01522.c has a radius of $0.704 R_\oplus$ and completes an orbit around a Sun-like G dwarf in 0.64672 days; K03404.b, with a radius of $0.738 R_\oplus$, orbits a G dwarf on a 0.68074-day period; and K04978.b, with its planetary radius of $0.912 R_\oplus$, orbits a G dwarf, completing an orbit every 0.94197 days. Three of our finds, K01821.b, K01522.c and K03404.b, rank as the smallest planets among all confirmed USPs orbiting G dwarfs in the Kepler dataset. The discovery of these small exoplanets underscores the promising capability of the GPFC method for searching for small, new transiting exoplanets in photometry data from Kepler, TESS, and future space transit missions.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
The GPU Phase Folding and Deep Learning Method for Detecting Exoplanet Transits
Authors:
Kaitlyn Wang,
Jian Ge,
Kevin Willis,
Kevin Wang,
Yinan Zhao
Abstract:
This paper presents GPFC, a novel Graphics Processing Unit (GPU) Phase Folding and Convolutional Neural Network (CNN) system to detect exoplanets using the transit method. We devise a fast folding algorithm parallelized on a GPU to amplify low signal-to-noise ratio transit signals, allowing a search at high precision and speed. A CNN trained on two million synthetic light curves reports a score in…
▽ More
This paper presents GPFC, a novel Graphics Processing Unit (GPU) Phase Folding and Convolutional Neural Network (CNN) system to detect exoplanets using the transit method. We devise a fast folding algorithm parallelized on a GPU to amplify low signal-to-noise ratio transit signals, allowing a search at high precision and speed. A CNN trained on two million synthetic light curves reports a score indicating the likelihood of a planetary signal at each period. While the GPFC method has broad applicability across period ranges, this research specifically focuses on detecting ultra-short-period planets with orbital periods less than one day. GPFC improves on speed by three orders of magnitude over the predominant Box-fitting Least Squares (BLS) method. Our simulation results show GPFC achieves $97%$ training accuracy, higher true positive rate at the same false positive rate of detection, and higher precision at the same recall rate when compared to BLS. GPFC recovers $100\%$ of known ultra-short-period planets in $\textit{Kepler}$ light curves from a blind search. These results highlight the promise of GPFC as an alternative approach to the traditional BLS algorithm for finding new transiting exoplanets in data taken with $\textit{Kepler}$ and other space transit missions such as K2, TESS and future PLATO and Earth 2.0.
△ Less
Submitted 21 January, 2024; v1 submitted 4 December, 2023;
originally announced December 2023.
-
ASAP: Automated Sequence Planning for Complex Robotic Assembly with Physical Feasibility
Authors:
Yunsheng Tian,
Karl D. D. Willis,
Bassel Al Omari,
Jieliang Luo,
**chuan Ma,
Yichen Li,
Farhad Javid,
Edward Gu,
Joshua Jacob,
Shinjiro Sueda,
Hui Li,
Sachin Chitta,
Wojciech Matusik
Abstract:
The automated assembly of complex products requires a system that can automatically plan a physically feasible sequence of actions for assembling many parts together. In this paper, we present ASAP, a physics-based planning approach for automatically generating such a sequence for general-shaped assemblies. ASAP accounts for gravity to design a sequence where each sub-assembly is physically stable…
▽ More
The automated assembly of complex products requires a system that can automatically plan a physically feasible sequence of actions for assembling many parts together. In this paper, we present ASAP, a physics-based planning approach for automatically generating such a sequence for general-shaped assemblies. ASAP accounts for gravity to design a sequence where each sub-assembly is physically stable with a limited number of parts being held and a support surface. We apply efficient tree search algorithms to reduce the combinatorial complexity of determining such an assembly sequence. The search can be guided by either geometric heuristics or graph neural networks trained on data with simulation labels. Finally, we show the superior performance of ASAP at generating physically realistic assembly sequence plans on a large dataset of hundreds of complex product assemblies. We further demonstrate the applicability of ASAP on both simulation and real-world robotic setups. Project website: asap.csail.mit.edu
△ Less
Submitted 29 February, 2024; v1 submitted 28 September, 2023;
originally announced September 2023.
-
TExplain: Explaining Learned Visual Features via Pre-trained (Frozen) Language Models
Authors:
Saeid Asgari Taghanaki,
Aliasghar Khani,
Ali Saheb Pasand,
Amir Khasahmadi,
Aditya Sanghi,
Karl D. D. Willis,
Ali Mahdavi-Amiri
Abstract:
Interpreting the learned features of vision models has posed a longstanding challenge in the field of machine learning. To address this issue, we propose a novel method that leverages the capabilities of language models to interpret the learned features of pre-trained image classifiers. Our method, called TExplain, tackles this task by training a neural network to establish a connection between th…
▽ More
Interpreting the learned features of vision models has posed a longstanding challenge in the field of machine learning. To address this issue, we propose a novel method that leverages the capabilities of language models to interpret the learned features of pre-trained image classifiers. Our method, called TExplain, tackles this task by training a neural network to establish a connection between the feature space of image classifiers and language models. Then, during inference, our approach generates a vast number of sentences to explain the features learned by the classifier for a given image. These sentences are then used to extract the most frequent words, providing a comprehensive understanding of the learned features and patterns within the classifier. Our method, for the first time, utilizes these frequent words corresponding to a visual representation to provide insights into the decision-making process of the independently trained classifier, enabling the detection of spurious correlations, biases, and a deeper comprehension of its behavior. To validate the effectiveness of our approach, we conduct experiments on diverse datasets, including ImageNet-9L and Waterbirds. The results demonstrate the potential of our method to enhance the interpretability and robustness of image classifiers.
△ Less
Submitted 1 May, 2024; v1 submitted 1 September, 2023;
originally announced September 2023.
-
Hierarchical Neural Coding for Controllable CAD Model Generation
Authors:
Xiang Xu,
Pradeep Kumar Jayaraman,
Joseph G. Lambourne,
Karl D. D. Willis,
Yasutaka Furukawa
Abstract:
This paper presents a novel generative model for Computer Aided Design (CAD) that 1) represents high-level design concepts of a CAD model as a three-level hierarchical tree of neural codes, from global part arrangement down to local curve geometry; and 2) controls the generation or completion of CAD models by specifying the target design using a code tree. Concretely, a novel variant of a vector q…
▽ More
This paper presents a novel generative model for Computer Aided Design (CAD) that 1) represents high-level design concepts of a CAD model as a three-level hierarchical tree of neural codes, from global part arrangement down to local curve geometry; and 2) controls the generation or completion of CAD models by specifying the target design using a code tree. Concretely, a novel variant of a vector quantized VAE with "masked skip connection" extracts design variations as neural codebooks at three levels. Two-stage cascaded auto-regressive transformers learn to generate code trees from incomplete CAD models and then complete CAD models following the intended design. Extensive experiments demonstrate superior performance on conventional tasks such as random generation while enabling novel interaction capabilities on conditional generation tasks. The code is available at https://github.com/samxuxiang/hnc-cad.
△ Less
Submitted 30 June, 2023;
originally announced July 2023.
-
Next Steps for Human-Centered Generative AI: A Technical Perspective
Authors:
Xiang 'Anthony' Chen,
Jeff Burke,
Ruofei Du,
Matthew K. Hong,
Jennifer Jacobs,
Philippe Laban,
Dingzeyu Li,
Nanyun Peng,
Karl D. D. Willis,
Chien-Sheng Wu,
Bolei Zhou
Abstract:
Through iterative, cross-disciplinary discussions, we define and propose next-steps for Human-centered Generative AI (HGAI). We contribute a comprehensive research agenda that lays out future directions of Generative AI spanning three levels: aligning with human values; assimilating human intents; and augmenting human abilities. By identifying these next-steps, we intend to draw interdisciplinary…
▽ More
Through iterative, cross-disciplinary discussions, we define and propose next-steps for Human-centered Generative AI (HGAI). We contribute a comprehensive research agenda that lays out future directions of Generative AI spanning three levels: aligning with human values; assimilating human intents; and augmenting human abilities. By identifying these next-steps, we intend to draw interdisciplinary research teams to pursue a coherent set of emergent ideas in HGAI, focusing on their interested topics while maintaining a coherent big picture of the future work landscape.
△ Less
Submitted 22 December, 2023; v1 submitted 27 June, 2023;
originally announced June 2023.
-
Neurosymbolic Models for Computer Graphics
Authors:
Daniel Ritchie,
Paul Guerrero,
R. Kenny Jones,
Niloy J. Mitra,
Adriana Schulz,
Karl D. D. Willis,
Jiajun Wu
Abstract:
Procedural models (i.e. symbolic programs that output visual data) are a historically-popular method for representing graphics content: vegetation, buildings, textures, etc. They offer many advantages: interpretable design parameters, stochastic variations, high-quality outputs, compact representation, and more. But they also have some limitations, such as the difficulty of authoring a procedural…
▽ More
Procedural models (i.e. symbolic programs that output visual data) are a historically-popular method for representing graphics content: vegetation, buildings, textures, etc. They offer many advantages: interpretable design parameters, stochastic variations, high-quality outputs, compact representation, and more. But they also have some limitations, such as the difficulty of authoring a procedural model from scratch. More recently, AI-based methods, and especially neural networks, have become popular for creating graphic content. These techniques allow users to directly specify desired properties of the artifact they want to create (via examples, constraints, or objectives), while a search, optimization, or learning algorithm takes care of the details. However, this ease of use comes at a cost, as it's often hard to interpret or manipulate these representations. In this state-of-the-art report, we summarize research on neurosymbolic models in computer graphics: methods that combine the strengths of both AI and symbolic programs to represent, generate, and manipulate visual data. We survey recent work applying these techniques to represent 2D shapes, 3D shapes, and materials & textures. Along the way, we situate each prior work in a unified design space for neurosymbolic models, which helps reveal underexplored areas and opportunities for future research.
△ Less
Submitted 20 April, 2023;
originally announced April 2023.
-
Newly discovered Ca II absorbers in the early universe: statistics, element abundances and dust
Authors:
Hannah Fang,
Iona Xia,
Jian Ge,
Kevin Willis,
Yinan Zhao
Abstract:
We report discoveries of 165 new quasar Ca II absorbers from the Sloan Digital Sky Survey (SDSS) Data Release 7 and 12. Our Ca II rest frame equivalent width distribution supports the weak and strong subpopulations, split at ${W}^{\lambda3934}_{0}=0.7$Å. Comparison of both populations' dust depletion shows clear consistency for weak absorber association with halo-type gas in the Milky Way (MW) whi…
▽ More
We report discoveries of 165 new quasar Ca II absorbers from the Sloan Digital Sky Survey (SDSS) Data Release 7 and 12. Our Ca II rest frame equivalent width distribution supports the weak and strong subpopulations, split at ${W}^{\lambda3934}_{0}=0.7$Å. Comparison of both populations' dust depletion shows clear consistency for weak absorber association with halo-type gas in the Milky Way (MW) while strong absorbers have environments consistent with halo and disc-type gas. We probed our high redshift Ca II absorbers for 2175Å dust bumps, discovering 12 2175Å dust absorbers (2DAs). This clearly shows that some Ca II absorbers follow the Large Magellanic Cloud (LMC) extinction law rather than the Small Magellanic Cloud extinction law. About 33% of our strong Ca II absorbers exhibit the 2175Å dust bump while only 6% of weak Ca II absorbers show this bump. 2DA detection further supports the theory that strong Ca II absorbers are associated with disk components and are dustier than the weak population. Comparing average Ca II absorber dust depletion patterns to that of Damped Lyα Absorbers (DLAs), Mg II absorbers, and 2DAs shows that Ca II absorbers generally have environments with more dust than DLAs and Mg II absorbers, but less dust than 2DAs. Comparing 2175Å dust bump strengths from different samples and also the MW and LMC, the bump strength appears to grow stronger as the redshift decreases, indicating dust growth and the global chemical enrichment of galaxies in the universe over time.
△ Less
Submitted 23 November, 2022;
originally announced November 2022.
-
Assemble Them All: Physics-Based Planning for Generalizable Assembly by Disassembly
Authors:
Yunsheng Tian,
Jie Xu,
Yichen Li,
Jieliang Luo,
Shinjiro Sueda,
Hui Li,
Karl D. D. Willis,
Wojciech Matusik
Abstract:
Assembly planning is the core of automating product assembly, maintenance, and recycling for modern industrial manufacturing. Despite its importance and long history of research, planning for mechanical assemblies when given the final assembled state remains a challenging problem. This is due to the complexity of dealing with arbitrary 3D shapes and the highly constrained motion required for real-…
▽ More
Assembly planning is the core of automating product assembly, maintenance, and recycling for modern industrial manufacturing. Despite its importance and long history of research, planning for mechanical assemblies when given the final assembled state remains a challenging problem. This is due to the complexity of dealing with arbitrary 3D shapes and the highly constrained motion required for real-world assemblies. In this work, we propose a novel method to efficiently plan physically plausible assembly motion and sequences for real-world assemblies. Our method leverages the assembly-by-disassembly principle and physics-based simulation to efficiently explore a reduced search space. To evaluate the generality of our method, we define a large-scale dataset consisting of thousands of physically valid industrial assemblies with a variety of assembly motions required. Our experiments on this new benchmark demonstrate we achieve a state-of-the-art success rate and the highest computational efficiency compared to other baseline algorithms. Our method also generalizes to rotational assemblies (e.g., screws and puzzles) and solves 80-part assemblies within several minutes.
△ Less
Submitted 7 November, 2022;
originally announced November 2022.
-
Data Models for Dataset Drift Controls in Machine Learning With Optical Images
Authors:
Luis Oala,
Marco Aversa,
Gabriel Nobis,
Kurt Willis,
Yoan Neuenschwander,
Michèle Buck,
Christian Matek,
Jerome Extermann,
Enrico Pomarico,
Wojciech Samek,
Roderick Murray-Smith,
Christoph Clausen,
Bruno Sanguinetti
Abstract:
Camera images are ubiquitous in machine learning research. They also play a central role in the delivery of important services spanning medicine and environmental surveying. However, the application of machine learning models in these domains has been limited because of robustness concerns. A primary failure mode are performance drops due to differences between the training and deployment data. Wh…
▽ More
Camera images are ubiquitous in machine learning research. They also play a central role in the delivery of important services spanning medicine and environmental surveying. However, the application of machine learning models in these domains has been limited because of robustness concerns. A primary failure mode are performance drops due to differences between the training and deployment data. While there are methods to prospectively validate the robustness of machine learning models to such dataset drifts, existing approaches do not account for explicit models of the primary object of interest: the data. This limits our ability to study and understand the relationship between data generation and downstream machine learning model performance in a physically accurate manner. In this study, we demonstrate how to overcome this limitation by pairing traditional machine learning with physical optics to obtain explicit and differentiable data models. We demonstrate how such data models can be constructed for image data and used to control downstream machine learning model performance related to dataset drift. The findings are distilled into three applications. First, drift synthesis enables the controlled generation of physically faithful drift test cases to power model selection and targeted generalization. Second, the gradient connection between machine learning task model and data model allows advanced, precise tolerancing of task model sensitivity to changes in the data generation. These drift forensics can be used to precisely specify the acceptable data environments in which a task model may be run. Third, drift optimization opens up the possibility to create drifts that can help the task model learn better faster, effectively optimizing the data generating process itself. A guide to access the open code and datasets is available at https://github.com/aiaudit-org/raw2logit.
△ Less
Submitted 7 May, 2023; v1 submitted 4 November, 2022;
originally announced November 2022.
-
CLIP-Sculptor: Zero-Shot Generation of High-Fidelity and Diverse Shapes from Natural Language
Authors:
Aditya Sanghi,
Rao Fu,
Vivian Liu,
Karl Willis,
Hooman Shayani,
Amir Hosein Khasahmadi,
Srinath Sridhar,
Daniel Ritchie
Abstract:
Recent works have demonstrated that natural language can be used to generate and edit 3D shapes. However, these methods generate shapes with limited fidelity and diversity. We introduce CLIP-Sculptor, a method to address these constraints by producing high-fidelity and diverse 3D shapes without the need for (text, shape) pairs during training. CLIP-Sculptor achieves this in a multi-resolution appr…
▽ More
Recent works have demonstrated that natural language can be used to generate and edit 3D shapes. However, these methods generate shapes with limited fidelity and diversity. We introduce CLIP-Sculptor, a method to address these constraints by producing high-fidelity and diverse 3D shapes without the need for (text, shape) pairs during training. CLIP-Sculptor achieves this in a multi-resolution approach that first generates in a low-dimensional latent space and then upscales to a higher resolution for improved shape fidelity. For improved shape diversity, we use a discrete latent space which is modeled using a transformer conditioned on CLIP's image-text embedding space. We also present a novel variant of classifier-free guidance, which improves the accuracy-diversity trade-off. Finally, we perform extensive experiments demonstrating that CLIP-Sculptor outperforms state-of-the-art baselines. The code is available at https://ivl.cs.brown.edu/#/projects/clip-sculptor.
△ Less
Submitted 24 May, 2023; v1 submitted 2 November, 2022;
originally announced November 2022.
-
Discovering Ca II Absorption Lines With a Neural Network
Authors:
Iona Xia,
Jian Ge,
Kevin Willis,
Yinan Zhao
Abstract:
Quasar absorption line analysis is critical for studying gas and dust components and their physical and chemical properties as well as the evolution and formation of galaxies in the early universe. Ca II absorbers, which are one of the dustiest absorbers and are located at lower redshifts than most other absorbers, are especially valuable when studying physical processes and conditions in recent g…
▽ More
Quasar absorption line analysis is critical for studying gas and dust components and their physical and chemical properties as well as the evolution and formation of galaxies in the early universe. Ca II absorbers, which are one of the dustiest absorbers and are located at lower redshifts than most other absorbers, are especially valuable when studying physical processes and conditions in recent galaxies. However, the number of known quasar Ca II absorbers is relatively low due to the difficulty of detecting them with traditional methods. In this work, we developed an accurate and quick approach to search for Ca II absorption lines using deep learning. In our deep learning model, a convolutional neural network, tuned using simulated data, is used for the classification task. The simulated training data are generated by inserting artificial Ca II absorption lines into original quasar spectra from the Sloan Digital Sky Survey (SDSS) whilst an existing Ca II catalog is adopted as the test set. The resulting model achieves an accuracy of 96% on the real data in the test set. Our solution runs thousands of times faster than traditional methods, taking a fraction of a second to analyze thousands of quasars while traditional methods may take days to weeks. The trained neural network is applied to quasar spectra from SDSS's DR7 and DR12 and discovered 399 new quasar Ca II absorbers. In addition, we confirmed 409 known quasar Ca II absorbers identified previously by other research groups through traditional methods.
△ Less
Submitted 8 October, 2022;
originally announced October 2022.
-
Reconstructing editable prismatic CAD from rounded voxel models
Authors:
Joseph G. Lambourne,
Karl D. D. Willis,
Pradeep Kumar Jayaraman,
Longfei Zhang,
Aditya Sanghi,
Kamal Rahimi Malekshan
Abstract:
Reverse Engineering a CAD shape from other representations is an important geometric processing step for many downstream applications. In this work, we introduce a novel neural network architecture to solve this challenging task and approximate a smoothed signed distance function with an editable, constrained, prismatic CAD model. During training, our method reconstructs the input geometry in the…
▽ More
Reverse Engineering a CAD shape from other representations is an important geometric processing step for many downstream applications. In this work, we introduce a novel neural network architecture to solve this challenging task and approximate a smoothed signed distance function with an editable, constrained, prismatic CAD model. During training, our method reconstructs the input geometry in the voxel space by decomposing the shape into a series of 2D profile images and 1D envelope functions. These can then be recombined in a differentiable way allowing a geometric loss function to be defined. During inference, we obtain the CAD data by first searching a database of 2D constrained sketches to find curves which approximate the profile images, then extrude them and use Boolean operations to build the final CAD model. Our method approximates the target shape more closely than other methods and outputs highly editable constrained parametric sketches which are compatible with existing CAD software.
△ Less
Submitted 2 September, 2022;
originally announced September 2022.
-
Discovering Faint and High Apparent Motion Rate Near-Earth Asteroids Using A Deep Learning Program
Authors:
Franklin Wang,
Jian Ge,
Kevin Willis
Abstract:
Although many near-Earth objects have been found by ground-based telescopes, some fast-moving ones, especially those near detection limits, have been missed by observatories. We developed a convolutional neural network for detecting faint fast-moving near-Earth objects. It was trained with artificial streaks generated from simulations and was able to find these asteroid streaks with an accuracy of…
▽ More
Although many near-Earth objects have been found by ground-based telescopes, some fast-moving ones, especially those near detection limits, have been missed by observatories. We developed a convolutional neural network for detecting faint fast-moving near-Earth objects. It was trained with artificial streaks generated from simulations and was able to find these asteroid streaks with an accuracy of 98.7% and a false positive rate of 0.02% on simulated data. This program was used to search image data from the Zwicky Transient Facility (ZTF) in four nights in 2019, and it identified six previously undiscovered asteroids. The visual magnitudes of our detections range from ~19.0 - 20.3 and motion rates range from ~6.8 - 24 deg/day, which is very faint compared to other ZTF detections moving at similar motion rates. Our asteroids are also ~1 - 51 m diameter in size and ~5 - 60 lunar distances away at close approach, assuming their albedo values follow the albedo distribution function of known asteroids. The use of a purely simulated dataset to train our model enables the program to gain sensitivity in detecting faint and fast-moving objects while still being able to recover nearly all discoveries made by previously designed neural networks which used real detections to train neural networks. Our approach can be adopted by any observatory for detecting fast-moving asteroid streaks.
△ Less
Submitted 18 August, 2022;
originally announced August 2022.
-
Mates2Motion: Learning How Mechanical CAD Assemblies Work
Authors:
James Noeckel,
Benjamin T. Jones,
Karl Willis,
Brian Curless,
Adriana Schulz
Abstract:
We describe our work on inferring the degrees of freedom between mated parts in mechanical assemblies using deep learning on CAD representations. We train our model using a large dataset of real-world mechanical assemblies consisting of CAD parts and mates joining them together. We present methods for re-defining these mates to make them better reflect the motion of the assembly, as well as narrow…
▽ More
We describe our work on inferring the degrees of freedom between mated parts in mechanical assemblies using deep learning on CAD representations. We train our model using a large dataset of real-world mechanical assemblies consisting of CAD parts and mates joining them together. We present methods for re-defining these mates to make them better reflect the motion of the assembly, as well as narrowing down the possible axes of motion. We also conduct a user study to create a motion-annotated test set with more reliable labels.
△ Less
Submitted 4 May, 2023; v1 submitted 2 August, 2022;
originally announced August 2022.
-
SimCURL: Simple Contrastive User Representation Learning from Command Sequences
Authors:
Hang Chu,
Amir Hosein Khasahmadi,
Karl D. D. Willis,
Fraser Anderson,
Yaoli Mao,
Linh Tran,
Justin Matejka,
Jo Vermeulen
Abstract:
User modeling is crucial to understanding user behavior and essential for improving user experience and personalized recommendations. When users interact with software, vast amounts of command sequences are generated through logging and analytics systems. These command sequences contain clues to the users' goals and intents. However, these data modalities are highly unstructured and unlabeled, mak…
▽ More
User modeling is crucial to understanding user behavior and essential for improving user experience and personalized recommendations. When users interact with software, vast amounts of command sequences are generated through logging and analytics systems. These command sequences contain clues to the users' goals and intents. However, these data modalities are highly unstructured and unlabeled, making it difficult for standard predictive systems to learn from. We propose SimCURL, a simple yet effective contrastive self-supervised deep learning framework that learns user representation from unlabeled command sequences. Our method introduces a user-session network architecture, as well as session dropout as a novel way of data augmentation. We train and evaluate our method on a real-world command sequence dataset of more than half a billion commands. Our method shows significant improvement over existing methods when the learned representation is transferred to downstream tasks such as experience and expertise classification.
△ Less
Submitted 29 July, 2022;
originally announced July 2022.
-
SkexGen: Autoregressive Generation of CAD Construction Sequences with Disentangled Codebooks
Authors:
Xiang Xu,
Karl D. D. Willis,
Joseph G. Lambourne,
Chin-Yi Cheng,
Pradeep Kumar Jayaraman,
Yasutaka Furukawa
Abstract:
We present SkexGen, a novel autoregressive generative model for computer-aided design (CAD) construction sequences containing sketch-and-extrude modeling operations. Our model utilizes distinct Transformer architectures to encode topological, geometric, and extrusion variations of construction sequences into disentangled codebooks. Autoregressive Transformer decoders generate CAD construction sequ…
▽ More
We present SkexGen, a novel autoregressive generative model for computer-aided design (CAD) construction sequences containing sketch-and-extrude modeling operations. Our model utilizes distinct Transformer architectures to encode topological, geometric, and extrusion variations of construction sequences into disentangled codebooks. Autoregressive Transformer decoders generate CAD construction sequences sharing certain properties specified by the codebook vectors. Extensive experiments demonstrate that our disentangled codebook representation generates diverse and high-quality CAD models, enhances user control, and enables efficient exploration of the design space. The code is available at https://samxuxiang.github.io/skexgen.
△ Less
Submitted 11 July, 2022;
originally announced July 2022.
-
ET White Paper: To Find the First Earth 2.0
Authors:
Jian Ge,
Hui Zhang,
Weicheng Zang,
Hong** Deng,
Shude Mao,
Ji-Wei Xie,
Hui-Gen Liu,
Ji-Lin Zhou,
Kevin Willis,
Chelsea Huang,
Steve B. Howell,
Fabo Feng,
Jiapeng Zhu,
Xinyu Yao,
Beibei Liu,
Masataka Aizawa,
Wei Zhu,
Ya-** Li,
Bo Ma,
Quanzhi Ye,
Jie Yu,
Maosheng Xiang,
Cong Yu,
Shangfei Liu,
Ming Yang
, et al. (142 additional authors not shown)
Abstract:
We propose to develop a wide-field and ultra-high-precision photometric survey mission, temporarily named "Earth 2.0 (ET)". This mission is designed to measure, for the first time, the occurrence rate and the orbital distributions of Earth-sized planets. ET consists of seven 30cm telescopes, to be launched to the Earth-Sun's L2 point. Six of these are transit telescopes with a field of view of 500…
▽ More
We propose to develop a wide-field and ultra-high-precision photometric survey mission, temporarily named "Earth 2.0 (ET)". This mission is designed to measure, for the first time, the occurrence rate and the orbital distributions of Earth-sized planets. ET consists of seven 30cm telescopes, to be launched to the Earth-Sun's L2 point. Six of these are transit telescopes with a field of view of 500 square degrees. Staring in the direction that encompasses the original Kepler field for four continuous years, this monitoring will return tens of thousands of transiting planets, including the elusive Earth twins orbiting solar-type stars. The seventh telescope is a 30cm microlensing telescope that will monitor an area of 4 square degrees toward the galactic bulge. This, combined with simultaneous ground-based KMTNet observations, will measure masses for hundreds of long-period and free-floating planets. Together, the transit and the microlensing telescopes will revolutionize our understandings of terrestrial planets across a large swath of orbital distances and free space. In addition, the survey data will also facilitate studies in the fields of asteroseismology, Galactic archeology, time-domain sciences, and black holes in binaries.
△ Less
Submitted 14 June, 2022;
originally announced June 2022.
-
SolidGen: An Autoregressive Model for Direct B-rep Synthesis
Authors:
Pradeep Kumar Jayaraman,
Joseph G. Lambourne,
Nishkrit Desai,
Karl D. D. Willis,
Aditya Sanghi,
Nigel J. W. Morris
Abstract:
The Boundary representation (B-rep) format is the de-facto shape representation in computer-aided design (CAD) to model solid and sheet objects. Recent approaches to generating CAD models have focused on learning sketch-and-extrude modeling sequences that are executed by a solid modeling kernel in postprocess to recover a B-rep. In this paper we present a new approach that enables learning from an…
▽ More
The Boundary representation (B-rep) format is the de-facto shape representation in computer-aided design (CAD) to model solid and sheet objects. Recent approaches to generating CAD models have focused on learning sketch-and-extrude modeling sequences that are executed by a solid modeling kernel in postprocess to recover a B-rep. In this paper we present a new approach that enables learning from and synthesizing B-reps without the need for supervision through CAD modeling sequence data. Our method SolidGen, is an autoregressive neural network that models the B-rep directly by predicting the vertices, edges, and faces using Transformer-based and pointer neural networks. Key to achieving this is our Indexed Boundary Representation that references B-rep vertices, edges and faces in a well-defined hierarchy to capture the geometric and topological relations suitable for use with machine learning. SolidGen can be easily conditioned on contexts e.g., class labels, images, and voxels thanks to its probabilistic modeling of the B-rep distribution. We demonstrate qualitatively, quantitatively, and through perceptual evaluation by human subjects that SolidGen can produce high quality, realistic CAD models.
△ Less
Submitted 20 February, 2023; v1 submitted 25 March, 2022;
originally announced March 2022.
-
JoinABLe: Learning Bottom-up Assembly of Parametric CAD Joints
Authors:
Karl D. D. Willis,
Pradeep Kumar Jayaraman,
Hang Chu,
Yunsheng Tian,
Yifei Li,
Daniele Grandi,
Aditya Sanghi,
Linh Tran,
Joseph G. Lambourne,
Armando Solar-Lezama,
Wojciech Matusik
Abstract:
Physical products are often complex assemblies combining a multitude of 3D parts modeled in computer-aided design (CAD) software. CAD designers build up these assemblies by aligning individual parts to one another using constraints called joints. In this paper we introduce JoinABLe, a learning-based method that assembles parts together to form joints. JoinABLe uses the weak supervision available i…
▽ More
Physical products are often complex assemblies combining a multitude of 3D parts modeled in computer-aided design (CAD) software. CAD designers build up these assemblies by aligning individual parts to one another using constraints called joints. In this paper we introduce JoinABLe, a learning-based method that assembles parts together to form joints. JoinABLe uses the weak supervision available in standard parametric CAD files without the help of object class labels or human guidance. Our results show that by making network predictions over a graph representation of solid models we can outperform multiple baseline methods with an accuracy (79.53%) that approaches human performance (80%). Finally, to support future research we release the Fusion 360 Gallery assembly dataset, containing assemblies with rich information on joints, contact surfaces, holes, and the underlying assembly graph structure.
△ Less
Submitted 22 April, 2022; v1 submitted 24 November, 2021;
originally announced November 2021.
-
HumBugDB: A Large-scale Acoustic Mosquito Dataset
Authors:
Ivan Kiskin,
Marianne Sinka,
Adam D. Cobb,
Waqas Rafique,
Lawrence Wang,
Davide Zilli,
Benjamin Gutteridge,
Rinita Dam,
Theodoros Marinos,
Yunpeng Li,
Dickson Msaky,
Emmanuel Kaindoa,
Gerard Killeen,
Eva Herreros-Moya,
Kathy J. Willis,
Stephen J. Roberts
Abstract:
This paper presents the first large-scale multi-species dataset of acoustic recordings of mosquitoes tracked continuously in free flight. We present 20 hours of audio recordings that we have expertly labelled and tagged precisely in time. Significantly, 18 hours of recordings contain annotations from 36 different species. Mosquitoes are well-known carriers of diseases such as malaria, dengue and y…
▽ More
This paper presents the first large-scale multi-species dataset of acoustic recordings of mosquitoes tracked continuously in free flight. We present 20 hours of audio recordings that we have expertly labelled and tagged precisely in time. Significantly, 18 hours of recordings contain annotations from 36 different species. Mosquitoes are well-known carriers of diseases such as malaria, dengue and yellow fever. Collecting this dataset is motivated by the need to assist applications which utilise mosquito acoustics to conduct surveys to help predict outbreaks and inform intervention policy. The task of detecting mosquitoes from the sound of their wingbeats is challenging due to the difficulty in collecting recordings from realistic scenarios. To address this, as part of the HumBug project, we conducted global experiments to record mosquitoes ranging from those bred in culture cages to mosquitoes captured in the wild. Consequently, the audio recordings vary in signal-to-noise ratio and contain a broad range of indoor and outdoor background environments from Tanzania, Thailand, Kenya, the USA and the UK. In this paper we describe in detail how we collected, labelled and curated the data. The data is provided from a PostgreSQL database, which contains important metadata such as the capture method, age, feeding status and gender of the mosquitoes. Additionally, we provide code to extract features and train Bayesian convolutional neural networks for two key tasks: the identification of mosquitoes from their corresponding background environments, and the classification of detected mosquitoes into species. Our extensive dataset is both challenging to machine learning researchers focusing on acoustic identification, and critical to entomologists, geo-spatial modellers and other domain experts to understand mosquito behaviour, model their distribution, and manage the threat they pose to humans.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
Engineering Sketch Generation for Computer-Aided Design
Authors:
Karl D. D. Willis,
Pradeep Kumar Jayaraman,
Joseph G. Lambourne,
Hang Chu,
Yewen Pu
Abstract:
Engineering sketches form the 2D basis of parametric Computer-Aided Design (CAD), the foremost modeling paradigm for manufactured objects. In this paper we tackle the problem of learning based engineering sketch generation as a first step towards synthesis and composition of parametric CAD models. We propose two generative models, CurveGen and TurtleGen, for engineering sketch generation. Both mod…
▽ More
Engineering sketches form the 2D basis of parametric Computer-Aided Design (CAD), the foremost modeling paradigm for manufactured objects. In this paper we tackle the problem of learning based engineering sketch generation as a first step towards synthesis and composition of parametric CAD models. We propose two generative models, CurveGen and TurtleGen, for engineering sketch generation. Both models generate curve primitives without the need for a sketch constraint solver and explicitly consider topology for downstream use with constraints and 3D CAD modeling operations. We find in our perceptual evaluation using human subjects that both CurveGen and TurtleGen produce more realistic engineering sketches when compared with the current state-of-the-art for engineering sketch generation.
△ Less
Submitted 19 April, 2021;
originally announced April 2021.
-
Inferring CAD Modeling Sequences Using Zone Graphs
Authors:
Xianghao Xu,
Wenzhe Peng,
Chin-Yi Cheng,
Karl D. D. Willis,
Daniel Ritchie
Abstract:
In computer-aided design (CAD), the ability to "reverse engineer" the modeling steps used to create 3D shapes is a long-sought-after goal. This process can be decomposed into two sub-problems: converting an input mesh or point cloud into a boundary representation (or B-rep), and then inferring modeling operations which construct this B-rep. In this paper, we present a new system for solving the se…
▽ More
In computer-aided design (CAD), the ability to "reverse engineer" the modeling steps used to create 3D shapes is a long-sought-after goal. This process can be decomposed into two sub-problems: converting an input mesh or point cloud into a boundary representation (or B-rep), and then inferring modeling operations which construct this B-rep. In this paper, we present a new system for solving the second sub-problem. Central to our approach is a new geometric representation: the zone graph. Zones are the set of solid regions formed by extending all B-Rep faces and partitioning space with them; a zone graph has these zones as its nodes, with edges denoting geometric adjacencies between them. Zone graphs allow us to tractably work with industry-standard CAD operations, unlike prior work using CSG with parametric primitives. We focus on CAD programs consisting of sketch + extrude + Boolean operations, which are common in CAD practice. We phrase our problem as search in the space of such extrusions permitted by the zone graph, and we train a graph neural network to score potential extrusions in order to accelerate the search. We show that our approach outperforms an existing CSG inference baseline in terms of geometric reconstruction accuracy and reconstruction time, while also creating more plausible modeling sequences.
△ Less
Submitted 20 April, 2021; v1 submitted 30 March, 2021;
originally announced April 2021.
-
Post-Hoc Domain Adaptation via Guided Data Homogenization
Authors:
Kurt Willis,
Luis Oala
Abstract:
Addressing shifts in data distributions is an important prerequisite for the deployment of deep learning models to real-world settings. A general approach to this problem involves the adjustment of models to a new domain through transfer learning. However, in many cases, this is not applicable in a post-hoc manner to deployed models and further parameter adjustments jeopardize safety certification…
▽ More
Addressing shifts in data distributions is an important prerequisite for the deployment of deep learning models to real-world settings. A general approach to this problem involves the adjustment of models to a new domain through transfer learning. However, in many cases, this is not applicable in a post-hoc manner to deployed models and further parameter adjustments jeopardize safety certifications that were established beforehand. In such a context, we propose to deal with changes in the data distribution via guided data homogenization which shifts the burden of adaptation from the model to the data. This approach makes use of information about the training data contained implicitly in the deep learning model to learn a domain transfer function. This allows for a targeted deployment of models to unknown scenarios without changing the model itself. We demonstrate the potential of data homogenization through experiments on the CIFAR-10 and MNIST data sets.
△ Less
Submitted 8 April, 2021;
originally announced April 2021.
-
BRepNet: A topological message passing system for solid models
Authors:
Joseph G. Lambourne,
Karl D. D. Willis,
Pradeep Kumar Jayaraman,
Aditya Sanghi,
Peter Meltzer,
Hooman Shayani
Abstract:
Boundary representation (B-rep) models are the standard way 3D shapes are described in Computer-Aided Design (CAD) applications. They combine lightweight parametric curves and surfaces with topological information which connects the geometric entities to describe manifolds. In this paper we introduce BRepNet, a neural network architecture designed to operate directly on B-rep data structures, avoi…
▽ More
Boundary representation (B-rep) models are the standard way 3D shapes are described in Computer-Aided Design (CAD) applications. They combine lightweight parametric curves and surfaces with topological information which connects the geometric entities to describe manifolds. In this paper we introduce BRepNet, a neural network architecture designed to operate directly on B-rep data structures, avoiding the need to approximate the model as meshes or point clouds. BRepNet defines convolutional kernels with respect to oriented coedges in the data structure. In the neighborhood of each coedge, a small collection of faces, edges and coedges can be identified and patterns in the feature vectors from these entities detected by specific learnable parameters. In addition, to encourage further deep learning research with B-reps, we publish the Fusion 360 Gallery segmentation dataset. A collection of over 35,000 B-rep models annotated with information about the modeling operations which created each face. We demonstrate that BRepNet can segment these models with higher accuracy than methods working on meshes, and point clouds.
△ Less
Submitted 8 April, 2021; v1 submitted 1 April, 2021;
originally announced April 2021.
-
Fusion 360 Gallery: A Dataset and Environment for Programmatic CAD Construction from Human Design Sequences
Authors:
Karl D. D. Willis,
Yewen Pu,
Jieliang Luo,
Hang Chu,
Tao Du,
Joseph G. Lambourne,
Armando Solar-Lezama,
Wojciech Matusik
Abstract:
Parametric computer-aided design (CAD) is a standard paradigm used to design manufactured objects, where a 3D shape is represented as a program supported by the CAD software. Despite the pervasiveness of parametric CAD and a growing interest from the research community, currently there does not exist a dataset of realistic CAD models in a concise programmatic form. In this paper we present the Fus…
▽ More
Parametric computer-aided design (CAD) is a standard paradigm used to design manufactured objects, where a 3D shape is represented as a program supported by the CAD software. Despite the pervasiveness of parametric CAD and a growing interest from the research community, currently there does not exist a dataset of realistic CAD models in a concise programmatic form. In this paper we present the Fusion 360 Gallery, consisting of a simple language with just the sketch and extrude modeling operations, and a dataset of 8,625 human design sequences expressed in this language. We also present an interactive environment called the Fusion 360 Gym, which exposes the sequential construction of a CAD program as a Markov decision process, making it amendable to machine learning approaches. As a use case for our dataset and environment, we define the CAD reconstruction task of recovering a CAD program from a target geometry. We report results of applying state-of-the-art methods of program synthesis with neurally guided search on this task.
△ Less
Submitted 16 May, 2021; v1 submitted 5 October, 2020;
originally announced October 2020.
-
Quantifying uncertainty in spatio-temporal changes of upper-ocean heat content estimates: an internationally coordinated comparison
Authors:
Abhishek Savita,
Catia M. Domingues,
Tim Boyer,
Viktor Gouretski,
Masayoshi Ishii,
Gregory C. Johnson,
John M. Lyman,
Josh K. Willis,
Simon J. Marsland,
William Hobbs,
John A. Church,
Didier P. Monselesan,
Peter Dobrohotoff,
Rebecca Cowley,
Susan E. Wijffels
Abstract:
The Earth system is accumulating energy due to human-induced activities. More than 90 percent of this energy has been stored in the ocean as heat since 1970, with about 64 percent of that in the upper 700 m. Differences in upper ocean heat content anomaly (OHCA) estimates, however, exist. Here, we evaluate spread in upper OHCA estimates arising from choices in instrumental bias corrections and map…
▽ More
The Earth system is accumulating energy due to human-induced activities. More than 90 percent of this energy has been stored in the ocean as heat since 1970, with about 64 percent of that in the upper 700 m. Differences in upper ocean heat content anomaly (OHCA) estimates, however, exist. Here, we evaluate spread in upper OHCA estimates arising from choices in instrumental bias corrections and map** methods, in addition to the effect of using a common ocean mask. The same dataset was mapped by six research groups for 1970 to 2008, with six instrumental bias corrections applied to expendable bathythermograph (XBT) data. We find that use of a common ocean mask may impact estimation of global OHCA by +- 13 percent. Uncertainty due to map** method dominates over XBT bias correction at a global scale and is largest in the Indian Ocean and in the eddy-rich and frontal regions of all basins. Uncertainty due to XBT bias correction is largest in the Pacific Ocean within 30N to 30S. In both map** and XBT cases, spread is higher since the 1990s. Important differences in spatial trends among map** methods are found in the well-observed Northwest Atlantic and the poorly-observed Southern Ocean. Although our results cannot identify the best map** or bias correction schemes, they identify where and when greater uncertainties exist, and so where further refinements may yield the largest improvements. Our results highlight the need for a future international coordination to evaluate performance of existing map** methods.
△ Less
Submitted 7 September, 2020;
originally announced September 2020.
-
UV-Net: Learning from Boundary Representations
Authors:
Pradeep Kumar Jayaraman,
Aditya Sanghi,
Joseph G. Lambourne,
Karl D. D. Willis,
Thomas Davies,
Hooman Shayani,
Nigel Morris
Abstract:
We introduce UV-Net, a novel neural network architecture and representation designed to operate directly on Boundary representation (B-rep) data from 3D CAD models. The B-rep format is widely used in the design, simulation and manufacturing industries to enable sophisticated and precise CAD modeling operations. However, B-rep data presents some unique challenges when used with modern machine learn…
▽ More
We introduce UV-Net, a novel neural network architecture and representation designed to operate directly on Boundary representation (B-rep) data from 3D CAD models. The B-rep format is widely used in the design, simulation and manufacturing industries to enable sophisticated and precise CAD modeling operations. However, B-rep data presents some unique challenges when used with modern machine learning due to the complexity of the data structure and its support for both continuous non-Euclidean geometric entities and discrete topological entities. In this paper, we propose a unified representation for B-rep data that exploits the U and V parameter domain of curves and surfaces to model geometry, and an adjacency graph to explicitly model topology. This leads to a unique and efficient network architecture, UV-Net, that couples image and graph convolutional neural networks in a compute and memory-efficient manner. To aid in future research we present a synthetic labelled B-rep dataset, SolidLetters, derived from human designed fonts with variations in both geometry and topology. Finally we demonstrate that UV-Net can generalize to supervised and unsupervised tasks on five datasets, while outperforming alternate 3D shape representations such as point clouds, voxels, and meshes.
△ Less
Submitted 25 April, 2021; v1 submitted 17 June, 2020;
originally announced June 2020.
-
Conservation or deterioration in heritage sites? Estimating willingness to pay for preservation
Authors:
Ali Ardeshiri,
Roya Etminani Ghasrodashti,
Taha Hossein Rashidi,
Mahyar Ardeshiri,
Ken Willis
Abstract:
A significant part of the United Nations World Heritage Sites (WHSs) is located in develo** countries. These sites attract an increasing number of tourist and income to these countries. Unfortunately, many of these WHSs are in a poor condition due to climatic and environmental impacts; war and tourism pressure, requiring the urgent need for restoration and preservation (Tuan & Navrud, 2007). In…
▽ More
A significant part of the United Nations World Heritage Sites (WHSs) is located in develo** countries. These sites attract an increasing number of tourist and income to these countries. Unfortunately, many of these WHSs are in a poor condition due to climatic and environmental impacts; war and tourism pressure, requiring the urgent need for restoration and preservation (Tuan & Navrud, 2007). In this study, we characterise residents from Shiraz city (visitors and non-visitors) willingness to invest in the management of the heritage sites through models for the preservation of heritage and development of tourism as a local resource. The research looks at different categories of heritage sites within Shiraz city, Iran. The measurement instrument is a stated preference referendum task administered state-wide to a sample of 489 respondents, with the payment mechanism defined as a purpose-specific incremental levy of a fixed amount over a set period of years. A Latent Class Binary Logit model, using parametric constraints is used innovatively to deal with any strategic voting such as Yea-sayers and Nay-sayers, as well as revealing the latent heterogeneity among sample members. Results indicate that almost 14% of the sampled population is unwilling to be levied any amount (Nay-sayers) to preserve any heritage sites. Not recognizing the presence of nay-sayers in the data or recognizing them but eliminating them from the estimation will result in biased Willingness to Pay (WTP) results and, consequently, biased policy propositions by authorities. Moreover, it is found that the type of heritage site is a driver of WTP. The results from this study provide insights into the WTP of heritage site visitors and non-visitors with respect to avoiding the impacts of future erosion and destruction and contributing to heritage management and maintenance policies.
△ Less
Submitted 6 February, 2019;
originally announced February 2019.
-
BCCNet: Bayesian classifier combination neural network
Authors:
Olga Isupova,
Yunpeng Li,
Danil Kuzin,
Stephen J Roberts,
Katherine Willis,
Steven Reece
Abstract:
Machine learning research for develo** countries can demonstrate clear sustainable impact by delivering actionable and timely information to in-country government organisations (GOs) and NGOs in response to their critical information requirements. We co-create products with UK and in-country commercial, GO and NGO partners to ensure the machine learning algorithms address appropriate user needs…
▽ More
Machine learning research for develo** countries can demonstrate clear sustainable impact by delivering actionable and timely information to in-country government organisations (GOs) and NGOs in response to their critical information requirements. We co-create products with UK and in-country commercial, GO and NGO partners to ensure the machine learning algorithms address appropriate user needs whether for tactical decision making or evidence-based policy decisions. In one particular case, we developed and deployed a novel algorithm, BCCNet, to quickly process large quantities of unstructured data to prevent and respond to natural disasters. Crowdsourcing provides an efficient mechanism to generate labels from unstructured data to prime machine learning algorithms for large scale data analysis. However, these labels are often imperfect with qualities varying among different citizen scientists, which prohibits their direct use with many state-of-the-art machine learning techniques. We describe BCCNet, a framework that simultaneously aggregates biased and contradictory labels from the crowd and trains an automatic classifier to process new data. Our case studies, mosquito sound detection for malaria prevention and damage detection for disaster response, show the efficacy of our method in the challenging context of develo** world applications.
△ Less
Submitted 29 November, 2018;
originally announced November 2018.
-
Automated bird sound recognition in realistic settings
Authors:
Timos Papadopoulos,
Stephen J. Roberts,
Katherine J. Willis
Abstract:
We evaluated the effectiveness of an automated bird sound identification system in a situation that emulates a realistic, typical application. We trained classification algorithms on a crowd-sourced collection of bird audio recording data and restricted our training methods to be completely free of manual intervention. The approach is hence directly applicable to the analysis of multiple species c…
▽ More
We evaluated the effectiveness of an automated bird sound identification system in a situation that emulates a realistic, typical application. We trained classification algorithms on a crowd-sourced collection of bird audio recording data and restricted our training methods to be completely free of manual intervention. The approach is hence directly applicable to the analysis of multiple species collections, with labelling provided by crowd-sourced collection. We evaluated the performance of the bird sound recognition system on a realistic number of candidate classes, corresponding to real conditions. We investigated the use of two canonical classification methods, chosen due to their widespread use and ease of interpretation, namely a k Nearest Neighbour (kNN) classifier with histogram-based features and a Support Vector Machine (SVM) with time-summarisation features. We further investigated the use of a certainty measure, derived from the output probabilities of the classifiers, to enhance the interpretability and reliability of the class decisions. Our results demonstrate that both identification methods achieved similar performance, but we argue that the use of the kNN classifier offers somewhat more flexibility. Furthermore, we show that employing an outcome certainty measure provides a valuable and consistent indicator of the reliability of classification results. Our use of generic training data and our investigation of probabilistic classification methodologies that can flexibly address the variable number of candidate species/classes that are expected to be encountered in the field, directly contribute to the development of a practical bird sound identification system with potentially global application. Further, we show that certainty measures associated with identification outcomes can significantly contribute to the practical usability of the overall system.
△ Less
Submitted 4 September, 2018;
originally announced September 2018.
-
Chemo-kinematics of the Milky Way from the SDSS-III MARVELS Survey
Authors:
Nolan Grieves,
Jian Ge,
Neil Thomas,
Kevin Willis,
Bo Ma,
Diego Lorenzo-Oliveira,
A. B. A. Queiroz,
Luan Ghezzi,
Cristina Chiappini,
Friedrich Anders,
Letícia Dutra-Ferreira,
Gustavo F. Porto de Mello,
Basílio X. Santiago,
Luiz N. da Costa,
Ricardo L. C. Ogando,
E. F. del Peloso,
Jonathan C. Tan,
Donald P. Schneider,
Joshua Pepper,
Keivan G. Stassun,
Bo Zhao,
Dmitry Bizyaev,
Kaike Pan
Abstract:
Combining stellar atmospheric parameters, such as effective temperature, surface gravity, and metallicity, with barycentric radial velocity data provides insight into the chemo-dynamics of the Milky Way and our local Galactic environment. We analyze 3075 stars with spectroscopic data from the Sloan Digital Sky Survey III (SDSS-III) MARVELS radial velocity survey and present atmospheric parameters…
▽ More
Combining stellar atmospheric parameters, such as effective temperature, surface gravity, and metallicity, with barycentric radial velocity data provides insight into the chemo-dynamics of the Milky Way and our local Galactic environment. We analyze 3075 stars with spectroscopic data from the Sloan Digital Sky Survey III (SDSS-III) MARVELS radial velocity survey and present atmospheric parameters for 2343 dwarf stars using the spectral indices method, a modified version of the equivalent width method. We present barycentric radial velocities for a sample of 2610 stars with a median uncertainty of 0.3 km s$^{-1}$. We determine stellar ages using two independent methods and calculate ages for 2335 stars with a maximum-likelihood isochronal age-dating method and for 2194 stars with a Bayesian age-dating method. Using previously published parallax data we compute Galactic orbits and space velocities for 2504 stars to explore stellar populations based on kinematic and age parameters. This study combines good ages and exquisite velocities to explore local chemo-kinematics of the Milky Way, which complements many of the recent studies of giant stars with the APOGEE survey, and we find our results to be in agreement with current chemo-dynamical models of the Milky Way. Particularly, we find from our metallicity distributions and velocity-age relations of a kinematically-defined thin disk that the metal rich end has stars of all ages, even after we clean the sample of highly eccentric stars, suggesting that radial migration plays a key role in the metallicity scatter of the thin disk. All stellar parameters and kinematic data derived in this work are catalogued and published online in machine-readable form.
△ Less
Submitted 5 October, 2020; v1 submitted 30 March, 2018;
originally announced March 2018.
-
Cost-sensitive detection with variational autoencoders for environmental acoustic sensing
Authors:
Yunpeng Li,
Ivan Kiskin,
Davide Zilli,
Marianne Sinka,
Henry Chan,
Kathy Willis,
Stephen Roberts
Abstract:
Environmental acoustic sensing involves the retrieval and processing of audio signals to better understand our surroundings. While large-scale acoustic data make manual analysis infeasible, they provide a suitable playground for machine learning approaches. Most existing machine learning techniques developed for environmental acoustic sensing do not provide flexible control of the trade-off betwee…
▽ More
Environmental acoustic sensing involves the retrieval and processing of audio signals to better understand our surroundings. While large-scale acoustic data make manual analysis infeasible, they provide a suitable playground for machine learning approaches. Most existing machine learning techniques developed for environmental acoustic sensing do not provide flexible control of the trade-off between the false positive rate and the false negative rate. This paper presents a cost-sensitive classification paradigm, in which the hyper-parameters of classifiers and the structure of variational autoencoders are selected in a principled Neyman-Pearson framework. We examine the performance of the proposed approach using a dataset from the HumBug project which aims to detect the presence of mosquitoes using sound collected by simple embedded devices.
△ Less
Submitted 6 December, 2017;
originally announced December 2017.
-
Mosquito detection with low-cost smartphones: data acquisition for malaria research
Authors:
Yunpeng Li,
Davide Zilli,
Henry Chan,
Ivan Kiskin,
Marianne Sinka,
Stephen Roberts,
Kathy Willis
Abstract:
Mosquitoes are a major vector for malaria, causing hundreds of thousands of deaths in the develo** world each year. Not only is the prevention of mosquito bites of paramount importance to the reduction of malaria transmission cases, but understanding in more forensic detail the interplay between malaria, mosquito vectors, vegetation, standing water and human populations is crucial to the deploym…
▽ More
Mosquitoes are a major vector for malaria, causing hundreds of thousands of deaths in the develo** world each year. Not only is the prevention of mosquito bites of paramount importance to the reduction of malaria transmission cases, but understanding in more forensic detail the interplay between malaria, mosquito vectors, vegetation, standing water and human populations is crucial to the deployment of more effective interventions. Typically the presence and detection of malaria-vectoring mosquitoes is only quantified by hand-operated insect traps or signified by the diagnosis of malaria. If we are to gather timely, large-scale data to improve this situation, we need to automate the process of mosquito detection and classification as much as possible. In this paper, we present a candidate mobile sensing system that acts as both a portable early warning device and an automatic acoustic data acquisition pipeline to help fuel scientific inquiry and policy. The machine learning algorithm that powers the mobile system achieves excellent off-line multi-species detection performance while remaining computationally efficient. Further, we have conducted preliminary live mosquito detection tests using low-cost mobile phones and achieved promising results. The deployment of this system for field usage in Southeast Asia and Africa is planned in the near future. In order to accelerate processing of field recordings and labelling of collected data, we employ a citizen science platform in conjunction with automated methods, the former implemented using the Zooniverse platform, allowing crowdsourcing on a grand scale.
△ Less
Submitted 5 December, 2017; v1 submitted 16 November, 2017;
originally announced November 2017.
-
Mosquito Detection with Neural Networks: The Buzz of Deep Learning
Authors:
Ivan Kiskin,
Bernardo Pérez Orozco,
Theo Windebank,
Davide Zilli,
Marianne Sinka,
Kathy Willis,
Stephen Roberts
Abstract:
Many real-world time-series analysis problems are characterised by scarce data. Solutions typically rely on hand-crafted features extracted from the time or frequency domain allied with classification or regression engines which condition on this (often low-dimensional) feature vector. The huge advances enjoyed by many application domains in recent years have been fuelled by the use of deep learni…
▽ More
Many real-world time-series analysis problems are characterised by scarce data. Solutions typically rely on hand-crafted features extracted from the time or frequency domain allied with classification or regression engines which condition on this (often low-dimensional) feature vector. The huge advances enjoyed by many application domains in recent years have been fuelled by the use of deep learning architectures trained on large data sets. This paper presents an application of deep learning for acoustic event detection in a challenging, data-scarce, real-world problem. Our candidate challenge is to accurately detect the presence of a mosquito from its acoustic signature. We develop convolutional neural networks (CNNs) operating on wavelet transformations of audio recordings. Furthermore, we interrogate the network's predictive power by visualising statistics of network-excitatory samples. These visualisations offer a deep insight into the relative informativeness of components in the detection problem. We include comparisons with conventional classifiers, conditioned on both hand-tuned and generic features, to stress the strength of automatic deep feature learning. Detection is achieved with performance metrics significantly surpassing those of existing algorithmic methods, as well as marginally exceeding those attained by individual human experts.
△ Less
Submitted 15 May, 2017;
originally announced May 2017.
-
Exploring the Brown Dwarf Desert: New Substellar Companions from the SDSS-III MARVELS Survey
Authors:
Nolan Grieves,
Jian Ge,
Neil Thomas,
Bo Ma,
Sirinrat Sithajan,
Luan Ghezzi,
Ben Kimock,
Kevin Willis,
Nathan De Lee,
Brian Lee,
Scott W. Fleming,
Eric Agol,
Nicholas Troup,
Martin Paegert,
Donald P. Schneider,
Keivan Stassun,
Frank Varosi,
Bo Zhao,
Jian Liu,
Rui Li,
Gustavo F. Porto de Mello,
Dmitry Bizyaev,
Kaike Pan,
Leticia Dutra-Ferreira,
Diego Lorenzo-Oliveira
, et al. (5 additional authors not shown)
Abstract:
Planet searches using the radial velocity technique show a paucity of companions to solar-type stars within ~5 AU in the mass range of ~10 - 80 M$_{\text{Jup}}$. This deficit, known as the brown dwarf desert, currently has no conclusive explanation. New substellar companions in this region help asses the reality of the desert and provide insight to the formation and evolution of these objects. Her…
▽ More
Planet searches using the radial velocity technique show a paucity of companions to solar-type stars within ~5 AU in the mass range of ~10 - 80 M$_{\text{Jup}}$. This deficit, known as the brown dwarf desert, currently has no conclusive explanation. New substellar companions in this region help asses the reality of the desert and provide insight to the formation and evolution of these objects. Here we present 10 new brown dwarf and two low-mass stellar companion candidates around solar-type stars from the Multi-object APO Radial-Velocity Exoplanet Large-Area Survey (MARVELS) of the Sloan Digital Sky Survey III (SDSS-III). These companions were selected from processed MARVELS data using the latest University of Florida Two Dimensional (UF2D) pipeline, which shows significant improvement and reduction of systematic errors over previous pipelines. The 10 brown dwarf companions range in mass from ~13 to 76 M$_{\text{Jup}}$ and have orbital radii of less than 1 AU. The two stellar companions have minimum masses of ~98 and 100 M$_{\text{Jup}}$. The host stars of the MARVELS brown dwarf sample have a mean metallicity of [Fe/H] = 0.03 $\pm$ 0.08 dex. Given our stellar sample we estimate the brown dwarf occurrence rate around solar-type stars with periods less than ~300 days to be ~0.56%.
△ Less
Submitted 6 February, 2017;
originally announced February 2017.
-
Combinatorial Analysis of a Subtraction Game on Graphs
Authors:
Richard Adams,
Janae Dixon,
Jennifer Elder,
Jamie Peabody,
Oscar Vega,
Karen Willis
Abstract:
We define a two-player combinatorial game in which players take alternate turns; each turn consists on deleting a vertex of a graph, together with all the edges containing such vertex. If any vertex became isolated by a player's move then it would also be deleted. A player wins the game when the other player has no moves available.
We study this game under various viewpoints: by finding specific…
▽ More
We define a two-player combinatorial game in which players take alternate turns; each turn consists on deleting a vertex of a graph, together with all the edges containing such vertex. If any vertex became isolated by a player's move then it would also be deleted. A player wins the game when the other player has no moves available.
We study this game under various viewpoints: by finding specific strategies for certain families of graphs, through using properties of a graph's automorphism group, by writing a program to look at Sprague-Grundy numbers, and by studying the game when played on random graphs.
When analyzing Grim played on paths, using the Sprague-Grundy function, we find a connection to a standing open question about Octal games.
△ Less
Submitted 1 August, 2016; v1 submitted 20 July, 2015;
originally announced July 2015.
-
Detecting bird sound in unknown acoustic background using crowdsourced training data
Authors:
Timos Papadopoulos,
Stephen Roberts,
Kathy Willis
Abstract:
Biodiversity monitoring using audio recordings is achievable at a truly global scale via large-scale deployment of inexpensive, unattended recording stations or by large-scale crowdsourcing using recording and species recognition on mobile devices. The ability, however, to reliably identify vocalising animal species is limited by the fact that acoustic signatures of interest in such recordings are…
▽ More
Biodiversity monitoring using audio recordings is achievable at a truly global scale via large-scale deployment of inexpensive, unattended recording stations or by large-scale crowdsourcing using recording and species recognition on mobile devices. The ability, however, to reliably identify vocalising animal species is limited by the fact that acoustic signatures of interest in such recordings are typically embedded in a diverse and complex acoustic background. To avoid the problems associated with modelling such backgrounds, we build generative models of bird sounds and use the concept of novelty detection to screen recordings to detect sections of data which are likely bird vocalisations. We present detection results against various acoustic environments and different signal-to-noise ratios. We discuss the issues related to selecting the cost function and setting detection thresholds in such algorithms. Our methods are designed to be scalable and automatically applicable to arbitrary selections of species depending on the specific geographic region and time period of deployment.
△ Less
Submitted 24 May, 2015;
originally announced May 2015.
-
EMC/FDTD/MD simulation of carrier transport and electrodynamics in two-dimensional electron systems
Authors:
N. Sule,
K. J. Willis,
S. C. Hagness,
I. Knezevic
Abstract:
We present the implementation and application of a multiphysics simulation technique to carrier dynamics under electromagnetic excitation in supported two-dimensional electronic systems. The technique combines ensemble Monte Carlo (EMC) for carrier transport with finite-difference time-domain (FDTD) for electrodynamics and molecular dynamics (MD) for short-range Coulomb interactions among particle…
▽ More
We present the implementation and application of a multiphysics simulation technique to carrier dynamics under electromagnetic excitation in supported two-dimensional electronic systems. The technique combines ensemble Monte Carlo (EMC) for carrier transport with finite-difference time-domain (FDTD) for electrodynamics and molecular dynamics (MD) for short-range Coulomb interactions among particles. We demonstrate the use of this EMC/FDTD/MD technique by calculating the room-temperature dc and ac conductivity of graphene supported on SiO2.
△ Less
Submitted 20 May, 2014;
originally announced May 2014.
-
Resolving the Shocks in Radio Galaxy Nebulae: HST and Radio Imaging of 3C 171, 3C277.3 and PKS2250-41
Authors:
Avanti Tilak,
Christopher P. O'Dea,
Clive Tadhunter,
Karen Willis,
Rafaella Morganti,
Stefi A. Baum,
Anton Koekemoer,
Daniele Dallacasa
Abstract:
We present the results of HST/WFPC2 medium and narrow band imaging and VLA and MERLIN2 radio imaging of three powerful radio galaxies: 3C 171, 3C 277.3, and PKS 2250-41. We obtained images of the rest frame [OIII]$λ$5007 and [OII]$λ$3727 line emission using the Linear Ramp Filters on WFPC2. The correlations between the emission line morphology and the [OIII]/[OII] line ratios with the radio emis…
▽ More
We present the results of HST/WFPC2 medium and narrow band imaging and VLA and MERLIN2 radio imaging of three powerful radio galaxies: 3C 171, 3C 277.3, and PKS 2250-41. We obtained images of the rest frame [OIII]$λ$5007 and [OII]$λ$3727 line emission using the Linear Ramp Filters on WFPC2. The correlations between the emission line morphology and the [OIII]/[OII] line ratios with the radio emission seen in ground based observations are clarified by the HST imaging. We confirm that the radio lobes and hot-spots are preferentially associated with lower ionization gas. 3C 171 exhibits high surface brightness emission line gas mainly along the radio source axis. The lowest ionization gas is seen at the Eastern hot spot. In 3C 277.3 there is bright high ionization gas (and continuum) offset just to the east of the radio knot K1. Our observations are consistent with previous work suggesting that this emission is produced by precursor gas ionized by the shock being driven into the cloud by the deflected radio jet. In PKS 2250-41 we resolve the emission line arc which wraps around the outer rim of the western lobe. The lower ionization [OII] emission is nested just interior to the higher ionization [OIII] emission suggesting that we have resolved the cooling region behind the bow shock. We also detect possible continuum emission from the secondary hot-spot. Thus, our observations support the hypothesis that in these sources, the interaction between the expanding radio source and the ambient gas strongly influences the morphology, kinematics, and ionization of the gas.
△ Less
Submitted 27 July, 2005;
originally announced July 2005.
-
The Anglo-Australian Observatory's 2dF Facility
Authors:
I. J. Lewis,
R. D. Cannon,
K. Taylor,
K. Glazebrook,
J. A. Bailey,
I. K. Baldry,
J. R. Barton,
T. J. Bridges,
G. B. Dalton,
T. J. Farrell,
P. M. Gray,
A. Lankshear,
C. McCowage,
I. R. Parry,
R. M. Sharples,
K. Shortridge,
G. A. Smith,
J. Stevenson,
J. O. Straede,
L. G. Waller,
J. D. Whittard,
J. K. Wilcox,
K. C. Willis
Abstract:
The 2dF (Two-degree Field) facility at the prime focus of the Anglo-Australian Telescope provides multiple object spectroscopy over a 2 degree field of view. Up to 400 target fibres can be independently positioned by a complex robot. Two spectrographs provide spectra with resolutions of between 500 and 2000, over wavelength ranges of 440nm and 110nm respectively. The 2dF facility began routine o…
▽ More
The 2dF (Two-degree Field) facility at the prime focus of the Anglo-Australian Telescope provides multiple object spectroscopy over a 2 degree field of view. Up to 400 target fibres can be independently positioned by a complex robot. Two spectrographs provide spectra with resolutions of between 500 and 2000, over wavelength ranges of 440nm and 110nm respectively. The 2dF facility began routine observations in 1997.
2dF was designed primarily for galaxy redshift surveys and has a number of innovative features. The large corrector lens incorporates an atmospheric dispersion compensator, essential for wide wavelength coverage with small diameter fibres. The instrument has two full sets of fibres on separate field plates, so that re-configuring can be done in parallel with observing. The robot positioner places one fibre every 6 seconds, to a precision of 0.3 arcsec (20micron) over the full field. All components of 2dF, including the spectrographs, are mounted on a 5-m diameter telescope top-end ring for ease of handling and to keep the optical fibres short in order to maximise UV throughput . There is a pipeline data reduction system which allows each data set to be fully analysed while the next field is being observed.
In this paper we provide the historical background to the 2dF facility, the design philosophy, a full technical description and a summary of the performance of the instrument. We also briefly review its scientific applications and possible future developments.
△ Less
Submitted 8 February, 2002;
originally announced February 2002.