-
FPGA Digital Dice using Pseudo Random Number Generator
Authors:
Michael Lim Kee Hian,
Ten Wei Lin,
Zachary Wu Xuan,
Stephanie-Ann Loy,
Maoyang Xiang,
T. Hui Teo
Abstract:
The goal of this project is to design a digital dice that displays dice numbers in real-time. The number is generated by a pseudo-random number generator (PRNG) using XORshift algorithm that is implemented in Verilog HDL on an FPGA. The digital dice is equipped with tilt sensor, display, power management circuit, and rechargeable battery hosted in a 3D printed dice casing. By shaking the digital d…
▽ More
The goal of this project is to design a digital dice that displays dice numbers in real-time. The number is generated by a pseudo-random number generator (PRNG) using XORshift algorithm that is implemented in Verilog HDL on an FPGA. The digital dice is equipped with tilt sensor, display, power management circuit, and rechargeable battery hosted in a 3D printed dice casing. By shaking the digital dice, the tilt sensor signal produces a seed for the PRNG. This digital dice demonstrates a set of possible random numbers of 2, 4, 6, 8, 10, 12, 20, 100 that simulate the number of dice sides. The kit is named SUTDicey.
△ Less
Submitted 1 May, 2024;
originally announced May 2024.
-
On the Uniqueness of Solution for the Bellman Equation of LTL Objectives
Authors:
Zetong Xuan,
Alper Kamil Bozkurt,
Miroslav Pajic,
Yu Wang
Abstract:
Surrogate rewards for linear temporal logic (LTL) objectives are commonly utilized in planning problems for LTL objectives. In a widely-adopted surrogate reward approach, two discount factors are used to ensure that the expected return approximates the satisfaction probability of the LTL objective. The expected return then can be estimated by methods using the Bellman updates such as reinforcement…
▽ More
Surrogate rewards for linear temporal logic (LTL) objectives are commonly utilized in planning problems for LTL objectives. In a widely-adopted surrogate reward approach, two discount factors are used to ensure that the expected return approximates the satisfaction probability of the LTL objective. The expected return then can be estimated by methods using the Bellman updates such as reinforcement learning. However, the uniqueness of the solution to the Bellman equation with two discount factors has not been explicitly discussed. We demonstrate with an example that when one of the discount factors is set to one, as allowed in many previous works, the Bellman equation may have multiple solutions, leading to inaccurate evaluation of the expected return. We then propose a condition for the Bellman equation to have the expected return as the unique solution, requiring the solutions for states inside a rejecting bottom strongly connected component (BSCC) to be 0. We prove this condition is sufficient by showing that the solutions for the states with discounting can be separated from those for the states without discounting under this condition
△ Less
Submitted 7 April, 2024;
originally announced April 2024.
-
Stochastic Gravitational Wave Background from Highly-Eccentric Stellar-Mass Binaries in the Milli-hertz Band
Authors:
Zeyuan Xuan,
Smadar Naoz,
Bence Kocsis,
Erez Michaely
Abstract:
Many gravitational wave (GW) sources are expected to have non-negligible eccentricity in the millihertz band. These highly eccentric compact object binaries may commonly serve as a progenitor stage of GW mergers, particularly in dynamical channels where environmental perturbations bring a binary with large initial orbital separation into a close pericenter passage, leading to efficient GW emission…
▽ More
Many gravitational wave (GW) sources are expected to have non-negligible eccentricity in the millihertz band. These highly eccentric compact object binaries may commonly serve as a progenitor stage of GW mergers, particularly in dynamical channels where environmental perturbations bring a binary with large initial orbital separation into a close pericenter passage, leading to efficient GW emission and a final merger. This work examines the stochastic GW background from highly eccentric ($e\gtrsim 0.9$), stellar-mass sources in the mHz band. Our findings suggest that these binaries can contribute a substantial GW power spectrum, potentially exceeding the LISA instrumental noise at $\sim 3-7$~mHz. This stochastic background is likely to be dominated by eccentric sources within the Milky Way, thus introducing anisotropy and time dependence in LISA's detection. However, given efficient search strategies to identify GW transients from highly eccentric binaries, the unresolvable noise level can be substantially lower, approaching $\sim 2$ orders of magnitude below the LISA noise curve. Therefore, we highlight the importance of characterizing stellar-mass GW sources with extreme eccentricity, especially their transient GW signals in the millihertz band.
△ Less
Submitted 12 June, 2024; v1 submitted 7 March, 2024;
originally announced March 2024.
-
UltrAvatar: A Realistic Animatable 3D Avatar Diffusion Model with Authenticity Guided Textures
Authors:
Mingyuan Zhou,
Rakib Hyder,
Ziwei Xuan,
Guojun Qi
Abstract:
Recent advances in 3D avatar generation have gained significant attentions. These breakthroughs aim to produce more realistic animatable avatars, narrowing the gap between virtual and real-world experiences. Most of existing works employ Score Distillation Sampling (SDS) loss, combined with a differentiable renderer and text condition, to guide a diffusion model in generating 3D avatars. However,…
▽ More
Recent advances in 3D avatar generation have gained significant attentions. These breakthroughs aim to produce more realistic animatable avatars, narrowing the gap between virtual and real-world experiences. Most of existing works employ Score Distillation Sampling (SDS) loss, combined with a differentiable renderer and text condition, to guide a diffusion model in generating 3D avatars. However, SDS often generates oversmoothed results with few facial details, thereby lacking the diversity compared with ancestral sampling. On the other hand, other works generate 3D avatar from a single image, where the challenges of unwanted lighting effects, perspective views, and inferior image quality make them difficult to reliably reconstruct the 3D face meshes with the aligned complete textures. In this paper, we propose a novel 3D avatar generation approach termed UltrAvatar with enhanced fidelity of geometry, and superior quality of physically based rendering (PBR) textures without unwanted lighting. To this end, the proposed approach presents a diffuse color extraction model and an authenticity guided texture diffusion model. The former removes the unwanted lighting effects to reveal true diffuse colors so that the generated avatars can be rendered under various lighting conditions. The latter follows two gradient-based guidances for generating PBR textures to render diverse face-identity features and details better aligning with 3D mesh geometry. We demonstrate the effectiveness and robustness of the proposed method, outperforming the state-of-the-art methods by a large margin in the experiments.
△ Less
Submitted 19 January, 2024;
originally announced January 2024.
-
AiDAC: A Low-Cost In-Memory Computing Architecture with All-Analog Multi-Bit Compute and Interconnect
Authors:
Zihao Xuan,
Song Chen,
Yi Kang
Abstract:
Analog in-memory computing (AiMC) is an emerging technology that shows fantastic performance superiority for neural network acceleration. However, as the computational bit-width and scale increase, high-precision data conversion and long-distance data routing will result in unacceptable energy and latency overheads in the AiMC system. In this work, we focus on the potential of in-charge computing…
▽ More
Analog in-memory computing (AiMC) is an emerging technology that shows fantastic performance superiority for neural network acceleration. However, as the computational bit-width and scale increase, high-precision data conversion and long-distance data routing will result in unacceptable energy and latency overheads in the AiMC system. In this work, we focus on the potential of in-charge computing and in-time interconnection and show an innovative AiMC architecture, named AiDAC, with three key contributions: (1) AiDAC enhances multibit computing efficiency and reduces data conversion times by grou** capacitors technology; (2) AiDAC first adopts row drivers and column time accumulators to achieve large-scale AiMC arrays integration while minimizing the energy cost of data movements. (3) AiDAC is the first work to support large-scale all-analog multibit vector-matrix multiplication (VMM) operations. The evaluation shows that AiDAC maintains high-precision calculation (less than 0.79% total computing error) while also possessing excellent performance features, such as high parallelism (up to 26.2TOPS), low latency (<20ns/VMM), and high energy efficiency (123.8TOPS/W), for 8bits VMM with 1024 input channels.
△ Less
Submitted 20 December, 2023; v1 submitted 18 December, 2023;
originally announced December 2023.
-
OmniMotionGPT: Animal Motion Generation with Limited Data
Authors:
Zhangsihao Yang,
Mingyuan Zhou,
Mengyi Shan,
Bingbing Wen,
Ziwei Xuan,
Mitch Hill,
Junjie Bai,
Guo-Jun Qi,
Yalin Wang
Abstract:
Our paper aims to generate diverse and realistic animal motion sequences from textual descriptions, without a large-scale animal text-motion dataset. While the task of text-driven human motion synthesis is already extensively studied and benchmarked, it remains challenging to transfer this success to other skeleton structures with limited data. In this work, we design a model architecture that imi…
▽ More
Our paper aims to generate diverse and realistic animal motion sequences from textual descriptions, without a large-scale animal text-motion dataset. While the task of text-driven human motion synthesis is already extensively studied and benchmarked, it remains challenging to transfer this success to other skeleton structures with limited data. In this work, we design a model architecture that imitates Generative Pretraining Transformer (GPT), utilizing prior knowledge learned from human data to the animal domain. We jointly train motion autoencoders for both animal and human motions and at the same time optimize through the similarity scores among human motion encoding, animal motion encoding, and text CLIP embedding. Presenting the first solution to this problem, we are able to generate animal motions with high diversity and fidelity, quantitatively and qualitatively outperforming the results of training human motion generation baselines on animal data. Additionally, we introduce AnimalML3D, the first text-animal motion dataset with 1240 animation sequences spanning 36 different animal identities. We hope this dataset would mediate the data scarcity problem in text-driven animal motion generation, providing a new playground for the research community.
△ Less
Submitted 30 November, 2023;
originally announced November 2023.
-
An analysis of the fragmentation of observing time at the Muztagh-ata site
Authors:
Gu Wen-bo,
Xu **g,
Feng Guo-jie,
Zhang Xuan,
Wang Le-tian,
Wang Xin-liang,
Ali Esamdin,
Shen li-xian
Abstract:
Cloud cover plays a pivotal role in assessing observational conditions for astronomical site-testing. Except for the fraction of observing time, its fragmentation also wields a significant influence on the quality of nighttime sky clarity. In this article, we introduce the function Gamma, designed to comprehensively capture both the fraction of available observing time and its continuity. Leveragi…
▽ More
Cloud cover plays a pivotal role in assessing observational conditions for astronomical site-testing. Except for the fraction of observing time, its fragmentation also wields a significant influence on the quality of nighttime sky clarity. In this article, we introduce the function Gamma, designed to comprehensively capture both the fraction of available observing time and its continuity. Leveraging in situ measurement data gathered at the Muztagh-ata site between 2017 and 2021, we showcase the effectiveness of our approach. The statistical result illustrates that the Muztagh-ata site affords approximately 122 nights of absolute clear and 205 very good nights annually, corresponding to Gamma greater than or equal 0.9 and Gamma greater than or equal 0.36 respectively.
△ Less
Submitted 23 November, 2023;
originally announced November 2023.
-
Detecting Gravitational Wave Bursts From Stellar-Mass Binaries in the Milli-hertz Band
Authors:
Zeyuan Xuan,
Smadar Naoz,
Bence Kocsis,
Erez Michaely
Abstract:
The dynamical formation channels of gravitational wave (GW) sources typically involve a stage when the compact object binary source interacts with the environment, which may excite its eccentricity, yielding efficient GW emission. For the wide eccentric compact object binaries, the GW emission happens mostly near the pericenter passage, creating a unique, burst-like signature in the waveform. This…
▽ More
The dynamical formation channels of gravitational wave (GW) sources typically involve a stage when the compact object binary source interacts with the environment, which may excite its eccentricity, yielding efficient GW emission. For the wide eccentric compact object binaries, the GW emission happens mostly near the pericenter passage, creating a unique, burst-like signature in the waveform. This work examines the possibility of stellar-mass bursting sources in the millihertz band for future LISA detections. Because of their long lifetime ($\sim 10^{7}\rm\, yr$) and promising detectability, the number of millihertz bursting sources can be large in the local universe. For example, based on our estimates, there will be $\sim 3 - 45$ bursting binary black holes in the Milky Way, with $\sim 10^{2} - 10^{4}$ bursts detected during the LISA mission. Moreover, we find that the number of bursting sources strongly depends on their formation history. If certain regions undergo active formation of compact object binaries in the recent few million years, there will be a significantly higher bursting source fraction. Thus, the detection of millihertz GW bursts not only serves as a clue for distinguishing different formation channels, but also helps us understand the star formation history in different regions of the Milky Way.
△ Less
Submitted 21 February, 2024; v1 submitted 29 September, 2023;
originally announced October 2023.
-
PEM: Representing Binary Program Semantics for Similarity Analysis via a Probabilistic Execution Model
Authors:
Xiangzhe Xu,
Zhou Xuan,
Shiwei Feng,
Siyuan Cheng,
Yapeng Ye,
Qingkai Shi,
Guanhong Tao,
Le Yu,
Zhuo Zhang,
Xiangyu Zhang
Abstract:
Binary similarity analysis determines if two binary executables are from the same source program. Existing techniques leverage static and dynamic program features and may utilize advanced Deep Learning techniques. Although they have demonstrated great potential, the community believes that a more effective representation of program semantics can further improve similarity analysis. In this paper,…
▽ More
Binary similarity analysis determines if two binary executables are from the same source program. Existing techniques leverage static and dynamic program features and may utilize advanced Deep Learning techniques. Although they have demonstrated great potential, the community believes that a more effective representation of program semantics can further improve similarity analysis. In this paper, we propose a new method to represent binary program semantics. It is based on a novel probabilistic execution engine that can effectively sample the input space and the program path space of subject binaries. More importantly, it ensures that the collected samples are comparable across binaries, addressing the substantial variations of input specifications. Our evaluation on 9 real-world projects with 35k functions, and comparison with 6 state-of-the-art techniques show that PEM can achieve a precision of 96% with common settings, outperforming the baselines by 10-20%.
△ Less
Submitted 29 August, 2023; v1 submitted 29 August, 2023;
originally announced August 2023.
-
AdPE: Adversarial Positional Embeddings for Pretraining Vision Transformers via MAE+
Authors:
Xiao Wang,
Ying Wang,
Ziwei Xuan,
Guo-Jun Qi
Abstract:
Unsupervised learning of vision transformers seeks to pretrain an encoder via pretext tasks without labels. Among them is the Masked Image Modeling (MIM) aligned with pretraining of language transformers by predicting masked patches as a pretext task. A criterion in unsupervised pretraining is the pretext task needs to be sufficiently hard to prevent the transformer encoder from learning trivial l…
▽ More
Unsupervised learning of vision transformers seeks to pretrain an encoder via pretext tasks without labels. Among them is the Masked Image Modeling (MIM) aligned with pretraining of language transformers by predicting masked patches as a pretext task. A criterion in unsupervised pretraining is the pretext task needs to be sufficiently hard to prevent the transformer encoder from learning trivial low-level features not generalizable well to downstream tasks. For this purpose, we propose an Adversarial Positional Embedding (AdPE) approach -- It distorts the local visual structures by perturbing the position encodings so that the learned transformer cannot simply use the locally correlated patches to predict the missing ones. We hypothesize that it forces the transformer encoder to learn more discriminative features in a global context with stronger generalizability to downstream tasks. We will consider both absolute and relative positional encodings, where adversarial positions can be imposed both in the embedding mode and the coordinate mode. We will also present a new MAE+ baseline that brings the performance of the MIM pretraining to a new level with the AdPE. The experiments demonstrate that our approach can improve the fine-tuning accuracy of MAE by $0.8\%$ and $0.4\%$ over 1600 epochs of pretraining ViT-B and ViT-L on Imagenet1K. For the transfer learning task, it outperforms the MAE with the ViT-B backbone by $2.6\%$ in mIoU on ADE20K, and by $3.2\%$ in AP$^{bbox}$ and $1.6\%$ in AP$^{mask}$ on COCO, respectively. These results are obtained with the AdPE being a pure MIM approach that does not use any extra models or external datasets for pretraining. The code is available at https://github.com/maple-research-lab/AdPE.
△ Less
Submitted 13 March, 2023;
originally announced March 2023.
-
Detecting Accelerating Eccentric Binaries in the LISA Band
Authors:
Zeyuan Xuan,
Smadar Naoz,
Xian Chen
Abstract:
Many gravitational wave (GW) sources in the LISA band are expected to have non-negligible eccentricity. Furthermore, many of them can undergo acceleration because they reside in the presence of a tertiary. Here we develop analytical and numerical methods to quantify how the compact binary's eccentricity enhances the detection of its peculiar acceleration. We show that the general relativistic prec…
▽ More
Many gravitational wave (GW) sources in the LISA band are expected to have non-negligible eccentricity. Furthermore, many of them can undergo acceleration because they reside in the presence of a tertiary. Here we develop analytical and numerical methods to quantify how the compact binary's eccentricity enhances the detection of its peculiar acceleration. We show that the general relativistic precession pattern can disentangle the binary's acceleration-induced frequency shift from the chirp-mass-induced frequency shift in GW template fitting, thus relaxing the signal-to-noise ratio requirement for distinguishing the acceleration by a factor of $10\sim100$. Moreover, by adopting the GW templates of the accelerating eccentric compact binaries, we can enhance the acceleration measurement accuracy by a factor of $\sim100$, compared to the zero-eccentricity case, and detect the source's acceleration even if it does not change during the observational time. For example, a stellar-mass binary black hole (BBH) with moderate eccentricity in the LISA band yields an error of the acceleration measurement $\sim10^{-7}m\cdot s^{-2}$ for $\rm{SNR}=20$ and observational time of $4$ yrs. In this example, we can measure the BBHs' peculiar acceleration even when it is $\sim1\rm pc$ away from a $4\times 10^{6}\rm M_{\odot}$ SMBH. Our results highlight the importance of eccentricity to the LISA-band sources and show the necessity of develo** GW templates for accelerating eccentric compact binaries.
△ Less
Submitted 11 January, 2023; v1 submitted 6 October, 2022;
originally announced October 2022.
-
NeuDep: Neural Binary Memory Dependence Analysis
Authors:
Kexin Pei,
Dongdong She,
Michael Wang,
Scott Geng,
Zhou Xuan,
Yaniv David,
Junfeng Yang,
Suman Jana,
Baishakhi Ray
Abstract:
Determining whether multiple instructions can access the same memory location is a critical task in binary analysis. It is challenging as statically computing precise alias information is undecidable in theory. The problem aggravates at the binary level due to the presence of compiler optimizations and the absence of symbols and types. Existing approaches either produce significant spurious depend…
▽ More
Determining whether multiple instructions can access the same memory location is a critical task in binary analysis. It is challenging as statically computing precise alias information is undecidable in theory. The problem aggravates at the binary level due to the presence of compiler optimizations and the absence of symbols and types. Existing approaches either produce significant spurious dependencies due to conservative analysis or scale poorly to complex binaries.
We present a new machine-learning-based approach to predict memory dependencies by exploiting the model's learned knowledge about how binary programs execute. Our approach features (i) a self-supervised procedure that pretrains a neural net to reason over binary code and its dynamic value flows through memory addresses, followed by (ii) supervised finetuning to infer the memory dependencies statically. To facilitate efficient learning, we develop dedicated neural architectures to encode the heterogeneous inputs (i.e., code, data values, and memory addresses from traces) with specific modules and fuse them with a composition learning strategy.
We implement our approach in NeuDep and evaluate it on 41 popular software projects compiled by 2 compilers, 4 optimizations, and 4 obfuscation passes. We demonstrate that NeuDep is more precise (1.5x) and faster (3.5x) than the current state-of-the-art. Extensive probing studies on security-critical reverse engineering tasks suggest that NeuDep understands memory access patterns, learns function signatures, and is able to match indirect calls. All these tasks either assist or benefit from inferring memory dependencies. Notably, NeuDep also outperforms the current state-of-the-art on these tasks.
△ Less
Submitted 4 October, 2022;
originally announced October 2022.
-
Molecular-scale Integration of Multi-modal Sensing and Neuromorphic Computing with Organic Electrochemical Transistors
Authors:
Shijie Wang,
Xi Chen,
Chao Zhao,
Yuxin Kong,
Baojun Lin,
Yongyi Wu,
Zhaozhao Bi,
Ziyi Xuan,
Tao Li,
Yuxiang Li,
Wei Zhang,
En Ma,
Zhongrui Wang,
Wei Ma
Abstract:
Abstract: Bionic learning with fused sensing, memory and processing functions outperforms artificial neural networks running on silicon chips in terms of efficiency and footprint. However, digital hardware implementation of bionic learning suffers from device heterogeneity in sensors and processing cores, which incurs large hardware, energy and time overheads. Here, we present a universal solution…
▽ More
Abstract: Bionic learning with fused sensing, memory and processing functions outperforms artificial neural networks running on silicon chips in terms of efficiency and footprint. However, digital hardware implementation of bionic learning suffers from device heterogeneity in sensors and processing cores, which incurs large hardware, energy and time overheads. Here, we present a universal solution to simultaneously perform multi-modal sensing, memory and processing using organic electrochemical transistors with designed architecture and tailored channel morphology, selective ion injection into the crystalline/amorphous regions. The resultant device work as either a volatile receptor that shows multi-modal sensing, or a non-volatile synapse that features record-high 10-bit analog states, low switching stochasticity and good retention without the integration of any extra devices. Homogeneous integration of such devices enables bionic learning functions such as conditioned reflex and real-time cardiac disease diagnose via reservoir computing, illustrating the promise for future smart edge health informatics.
△ Less
Submitted 19 February, 2022; v1 submitted 9 February, 2022;
originally announced February 2022.
-
Dual-Flattening Transformers through Decomposed Row and Column Queries for Semantic Segmentation
Authors:
Ying Wang,
Chiuman Ho,
Wenju Xu,
Ziwei Xuan,
Xudong Liu,
Guo-Jun Qi
Abstract:
It is critical to obtain high resolution features with long range dependency for dense prediction tasks such as semantic segmentation. To generate high-resolution output of size $H\times W$ from a low-resolution feature map of size $h\times w$ ($hw\ll HW$), a naive dense transformer incurs an intractable complexity of $\mathcal{O}(hwHW)$, limiting its application on high-resolution dense predictio…
▽ More
It is critical to obtain high resolution features with long range dependency for dense prediction tasks such as semantic segmentation. To generate high-resolution output of size $H\times W$ from a low-resolution feature map of size $h\times w$ ($hw\ll HW$), a naive dense transformer incurs an intractable complexity of $\mathcal{O}(hwHW)$, limiting its application on high-resolution dense prediction. We propose a Dual-Flattening Transformer (DFlatFormer) to enable high-resolution output by reducing complexity to $\mathcal{O}(hw(H+W))$ that is multiple orders of magnitude smaller than the naive dense transformer. Decomposed queries are presented to retrieve row and column attentions tractably through separate transformers, and their outputs are combined to form a dense feature map at high resolution. To this end, the input sequence fed from an encoder is row-wise and column-wise flattened to align with decomposed queries by preserving their row and column structures, respectively. Row and column transformers also interact with each other to capture their mutual attentions with the spatial crossings between rows and columns. We also propose to perform attentions through efficient grou** and pooling to further reduce the model complexity. Extensive experiments on ADE20K and Cityscapes datasets demonstrate the superiority of the proposed dual-flattening transformer architecture with higher mIoUs.
△ Less
Submitted 22 January, 2022;
originally announced January 2022.
-
R2RNet: Low-light Image Enhancement via Real-low to Real-normal Network
Authors:
Jiang Hai,
Zhu Xuan,
Songchen Han,
Ren Yang,
Yutong Hao,
Fengzhu Zou,
Fang Lin
Abstract:
Images captured in weak illumination conditions could seriously degrade the image quality. Solving a series of degradation of low-light images can effectively improve the visual quality of images and the performance of high-level visual tasks. In this study, a novel Retinex-based Real-low to Real-normal Network (R2RNet) is proposed for low-light image enhancement, which includes three subnets: a D…
▽ More
Images captured in weak illumination conditions could seriously degrade the image quality. Solving a series of degradation of low-light images can effectively improve the visual quality of images and the performance of high-level visual tasks. In this study, a novel Retinex-based Real-low to Real-normal Network (R2RNet) is proposed for low-light image enhancement, which includes three subnets: a Decom-Net, a Denoise-Net, and a Relight-Net. These three subnets are used for decomposing, denoising, contrast enhancement and detail preservation, respectively. Our R2RNet not only uses the spatial information of the image to improve the contrast but also uses the frequency information to preserve the details. Therefore, our model acheived more robust results for all degraded images. Unlike most previous methods that were trained on synthetic images, we collected the first Large-Scale Real-World paired low/normal-light images dataset (LSRW dataset) to satisfy the training requirements and make our model have better generalization performance in real-world scenes. Extensive experiments on publicly available datasets demonstrated that our method outperforms the existing state-of-the-art methods both quantitatively and visually. In addition, our results showed that the performance of the high-level visual task (i.e. face detection) can be effectively improved by using the enhanced results obtained by our method in low-light conditions. Our codes and the LSRW dataset are available at: https://github.com/abcdef2000/R2RNet.
△ Less
Submitted 11 November, 2021; v1 submitted 28 June, 2021;
originally announced June 2021.
-
Trex: Learning Execution Semantics from Micro-Traces for Binary Similarity
Authors:
Kexin Pei,
Zhou Xuan,
Junfeng Yang,
Suman Jana,
Baishakhi Ray
Abstract:
Detecting semantically similar functions -- a crucial analysis capability with broad real-world security usages including vulnerability detection, malware lineage, and forensics -- requires understanding function behaviors and intentions. This task is challenging as semantically similar functions can be implemented differently, run on different architectures, and compiled with diverse compiler opt…
▽ More
Detecting semantically similar functions -- a crucial analysis capability with broad real-world security usages including vulnerability detection, malware lineage, and forensics -- requires understanding function behaviors and intentions. This task is challenging as semantically similar functions can be implemented differently, run on different architectures, and compiled with diverse compiler optimizations or obfuscations. Most existing approaches match functions based on syntactic features without understanding the functions' execution semantics.
We present Trex, a transfer-learning-based framework, to automate learning execution semantics explicitly from functions' micro-traces and transfer the learned knowledge to match semantically similar functions. Our key insight is that these traces can be used to teach an ML model the execution semantics of different sequences of instructions. We thus train the model to learn execution semantics from the functions' micro-traces, without any manual labeling effort. We then develop a novel neural architecture to learn execution semantics from micro-traces, and we finetune the pretrained model to match semantically similar functions.
We evaluate Trex on 1,472,066 function binaries from 13 popular software projects. These functions are from different architectures and compiled with various optimizations and obfuscations. Trex outperforms the state-of-the-art systems by 7.8%, 7.2%, and 14.3% in cross-architecture, optimization, and obfuscation function matching, respectively. Ablation studies show that the pretraining significantly boosts the function matching performance, underscoring the importance of learning execution semantics.
△ Less
Submitted 26 April, 2021; v1 submitted 15 December, 2020;
originally announced December 2020.
-
Degeneracy between mass and peculiar acceleration for the double white dwarfs in the LISA band
Authors:
Zeyuan Xuan,
Peng Peng,
Xian Chen
Abstract:
Mass and distance are fundamental quantities to measure in gravitational-wave (GW) astronomy. However, recent studies suggest that the measurement may be biased due to the acceleration of GW source. Here we develop an analytical method to quantify such a bias induced by a tertiary on a double white dwarf (DWD), since DWDs are the most common GW sources in the milli-Hertz band. We show that in a la…
▽ More
Mass and distance are fundamental quantities to measure in gravitational-wave (GW) astronomy. However, recent studies suggest that the measurement may be biased due to the acceleration of GW source. Here we develop an analytical method to quantify such a bias induced by a tertiary on a double white dwarf (DWD), since DWDs are the most common GW sources in the milli-Hertz band. We show that in a large parameter space the mass is degenerate with the peculiar acceleration, so that from the waveform we can only retrieve a mass of ${\cal M}(1+Γ)^{3/5}$, where ${\cal M}$ is the real chirp mass of the DWD and $Γ$ is a dimensionless factor proportional to the peculiar acceleration. Based on our analytical method, we conduct mock observation of DWDs by the Laser Interferometer Space Antenna (LISA). We find that in about $9\%$ of the cases the measured chirp mass is biased due to the presence of a tertiary by $(5-30)\%$. Even more extreme cases are found in about a dozen DWDs and they may be misclassified as double neutron stars, binary black holes, DWDs undergoing mass transfer, or even binaries containing lower-mass-gap objects and primordial black holes. The bias in mass also affects the measurement of distance, resulting in a seemingly over-density of DWDs within a heliocentric distance of $1$ kpc as well as beyond $100$ kpc. Our result highlights the necessity of modeling the astrophysical environments of GW sources to retrieve their correct physical parameters.
△ Less
Submitted 30 November, 2020;
originally announced December 2020.
-
Exciting modes due to the aberration of gravitational waves: Measurability for extreme-mass-ratio inspirals
Authors:
Alejandro Torres-Orjuela,
Pau Amaro Seoane,
Zeyuan Xuan,
Alvin J. K. Chua,
María J. B. Rosell,
Xian Chen
Abstract:
Gravitational waves from a source moving relative to us can suffer from special-relativistic effects such as aberration. The required velocities for these to be significant are on the order of $1000\,\textrm{km s}^{-1}$. This value corresponds to the velocity dispersion that one finds in clusters of galaxies. Hence, we expect a large number of gravitational-wave sources to have such effects imprin…
▽ More
Gravitational waves from a source moving relative to us can suffer from special-relativistic effects such as aberration. The required velocities for these to be significant are on the order of $1000\,\textrm{km s}^{-1}$. This value corresponds to the velocity dispersion that one finds in clusters of galaxies. Hence, we expect a large number of gravitational-wave sources to have such effects imprinted in their signals. In particular, the signal from a moving source will have its higher modes excited, i.e., $(3,3)$ and beyond. We derive expressions describing this effect, and study its measurability for the specific case of a circular, non-spinning extreme-mass-ratio inspiral. We find that the excitation of higher modes by a peculiar velocity of $1000\,\textrm{km\,s}^{-1}$ is detectable for such inspirals with signal-to-noise ratios of $\gtrsim20$. Using a Fisher matrix analysis, we show that the velocity of the source can be measured to a precision of just a few percent for a signal-to-noise ratio of 100. If the motion of the source is ignored parameter estimates could be biased, e.g., the estimated masses of the components through a Doppler shift. Conversely, by including this effect in waveform models, we could measure the velocity dispersion of clusters of galaxies at distances inaccessible to light.
△ Less
Submitted 11 July, 2021; v1 submitted 29 October, 2020;
originally announced October 2020.
-
PCA-SRGAN: Incremental Orthogonal Projection Discrimination for Face Super-resolution
Authors:
Hao Dou,
Chen Chen,
Xiyuan Hu,
Zuxing Xuan,
Zhisen Hu,
Silong Peng
Abstract:
Generative Adversarial Networks (GAN) have been employed for face super resolution but they bring distorted facial details easily and still have weakness on recovering realistic texture. To further improve the performance of GAN based models on super-resolving face images, we propose PCA-SRGAN which pays attention to the cumulative discrimination in the orthogonal projection space spanned by PCA p…
▽ More
Generative Adversarial Networks (GAN) have been employed for face super resolution but they bring distorted facial details easily and still have weakness on recovering realistic texture. To further improve the performance of GAN based models on super-resolving face images, we propose PCA-SRGAN which pays attention to the cumulative discrimination in the orthogonal projection space spanned by PCA projection matrix of face data. By feeding the principal component projections ranging from structure to details into the discriminator, the discrimination difficulty will be greatly alleviated and the generator can be enhanced to reconstruct clearer contour and finer texture, helpful to achieve the high perception and low distortion eventually. This incremental orthogonal projection discrimination has ensured a precise optimization procedure from coarse to fine and avoids the dependence on the perceptual regularization. We conduct experiments on CelebA and FFHQ face datasets. The qualitative visual effect and quantitative evaluation have demonstrated the overwhelming performance of our model over related works.
△ Less
Submitted 28 August, 2020; v1 submitted 1 May, 2020;
originally announced May 2020.
-
Fake massive black holes in the milli-Hertz gravitational-wave band
Authors:
Xian Chen,
Ze-Yuan Xuan,
Peng Peng
Abstract:
In gravitational wave (GW) astronomy accurate measurement of the source parameters, such as mass, relies on accurate waveform templates. Currently, the templates are developed assuming that the source, such as a binary black hole (BBH), is residing in a vacuum. However, astrophysical models predict that BBHs could form in gaseous environments, such as common envelops, stellar cores, and accretion…
▽ More
In gravitational wave (GW) astronomy accurate measurement of the source parameters, such as mass, relies on accurate waveform templates. Currently, the templates are developed assuming that the source, such as a binary black hole (BBH), is residing in a vacuum. However, astrophysical models predict that BBHs could form in gaseous environments, such as common envelops, stellar cores, and accretion disks of active galactic nuclei. Here we revisit the impact of gas on the GW waveforms of stellar-mass BBHs with a focus on the early inspiral phase when the GW frequency is around milli-Hertz. We show that for these BBHs, gas friction could dominate the dynamical evolution and hence duplicate chirp signals. The relevant hydrodynamical timescale, $τ_{\rm gas}$, could be much shorter than the GW radiation timescale, $τ_{\rm gw}$, in the above astrophysical scenarios. As a result, the observable chirp mass is higher than the real one by a factor of $(1+τ_{\rm gw}/τ_{\rm gas})^{3/5}$ if the gas effect is ignored in the data analysis. Such an error also results in an overestimation of the source distance by a factor of $(1+τ_{\rm gw}/τ_{\rm gas})$. By performing matched-filtering analysis in the milli-Hertz band, we prove that the gas-dominated signals are practically indistinguishable from the chirp signals of those more massive BBHs residing in a vacuum environment. Such fake massive objects in the milli-Hertz band, if not appropriately accounted for in the future, may alter our understanding of the formation, evolution, and detection of BBHs.
△ Less
Submitted 19 March, 2020;
originally announced March 2020.
-
FGN: Fusion Glyph Network for Chinese Named Entity Recognition
Authors:
Zhenyu Xuan,
Rui Bao,
Shengyi Jiang
Abstract:
Chinese NER is a challenging task. As pictographs, Chinese characters contain latent glyph information, which is often overlooked. In this paper, we propose the FGN, Fusion Glyph Network for Chinese NER. Except for adding glyph information, this method may also add extra interactive information with the fusion mechanism. The major innovations of FGN include: (1) a novel CNN structure called CGS-CN…
▽ More
Chinese NER is a challenging task. As pictographs, Chinese characters contain latent glyph information, which is often overlooked. In this paper, we propose the FGN, Fusion Glyph Network for Chinese NER. Except for adding glyph information, this method may also add extra interactive information with the fusion mechanism. The major innovations of FGN include: (1) a novel CNN structure called CGS-CNN is proposed to capture both glyph information and interactive information between glyphs from neighboring characters. (2) we provide a method with sliding window and Slice-Attention to fuse the BERT representation and glyph representation for a character, which may capture potential interactive knowledge between context and glyph. Experiments are conducted on four NER datasets, showing that FGN with LSTM-CRF as tagger achieves new state-of-the-arts performance for Chinese NER. Further, more experiments are conducted to investigate the influences of various components and settings in FGN.
△ Less
Submitted 8 October, 2020; v1 submitted 15 January, 2020;
originally announced January 2020.
-
Characterization of an Ionization Readout Tile for nEXO
Authors:
nEXO Collaboration,
M. Jewell,
A. Schubert,
W. R. Cen,
J. Dalmasson,
R. DeVoe,
L. Fabris,
G. Gratta,
A. Jamil,
G. Li,
A. Odian,
M. Patel,
A. Pocar,
D. Qiu,
Q. Wang,
L. J. Wen,
J. B. Albert,
G. Anton,
I. J. Arnquist,
I. Badhrees,
P. Barbeau,
D. Beck,
V. Belov,
F. Bourque,
J. P. Brodsky
, et al. (120 additional authors not shown)
Abstract:
A new design for the anode of a time projection chamber, consisting of a charge-detecting "tile", is investigated for use in large scale liquid xenon detectors. The tile is produced by depositing 60 orthogonal metal charge-collecting strips, 3~mm wide, on a 10~\si{\cm} $\times$ 10~\si{\cm} fused-silica wafer. These charge tiles may be employed by large detectors, such as the proposed tonne-scale n…
▽ More
A new design for the anode of a time projection chamber, consisting of a charge-detecting "tile", is investigated for use in large scale liquid xenon detectors. The tile is produced by depositing 60 orthogonal metal charge-collecting strips, 3~mm wide, on a 10~\si{\cm} $\times$ 10~\si{\cm} fused-silica wafer. These charge tiles may be employed by large detectors, such as the proposed tonne-scale nEXO experiment to search for neutrinoless double-beta decay. Modular by design, an array of tiles can cover a sizable area. The width of each strip is small compared to the size of the tile, so a Frisch grid is not required. A grid-less, tiled anode design is beneficial for an experiment such as nEXO, where a wire tensioning support structure and Frisch grid might contribute radioactive backgrounds and would have to be designed to accommodate cycling to cryogenic temperatures. The segmented anode also reduces some degeneracies in signal reconstruction that arise in large-area crossed-wire time projection chambers. A prototype tile was tested in a cell containing liquid xenon. Very good agreement is achieved between the measured ionization spectrum of a $^{207}$Bi source and simulations that include the microphysics of recombination in xenon and a detailed modeling of the electrostatic field of the detector. An energy resolution $σ/E$=5.5\% is observed at 570~\si{keV}, comparable to the best intrinsic ionization-only resolution reported in literature for liquid xenon at 936~V/\si{cm}.
△ Less
Submitted 19 January, 2018; v1 submitted 13 October, 2017;
originally announced October 2017.
-
Fast Esca** Sets of meromorphic functions
Authors:
Jianhua Zheng,
Zuxing Xuan
Abstract:
In this paper, we give a definition of Eremenko's point of a meromorphic function with infinitely many poles and a condition for its existence in narrow annuli in terms of a covering theorem of annulus.
In this paper, we give a definition of Eremenko's point of a meromorphic function with infinitely many poles and a condition for its existence in narrow annuli in terms of a covering theorem of annulus.
△ Less
Submitted 16 February, 2020; v1 submitted 2 July, 2016;
originally announced July 2016.
-
A new geometric characterization of the Julia set
Authors:
Xiao Yao,
Daochun Sun,
Zuxing Xuan
Abstract:
This article concerns a new geometric characterization of the Julia set. By using Ahlfors-Shimizu's characteristic, we establish some growth results which indicates the characterization of the Julia set. The main technique is to estimate the lower bound of $S(f^n,U)$, where $U$ is an open neighbourhood of some point in $\mathcal{J}(f)$.
This article concerns a new geometric characterization of the Julia set. By using Ahlfors-Shimizu's characteristic, we establish some growth results which indicates the characterization of the Julia set. The main technique is to estimate the lower bound of $S(f^n,U)$, where $U$ is an open neighbourhood of some point in $\mathcal{J}(f)$.
△ Less
Submitted 16 December, 2015;
originally announced December 2015.
-
Simulating City-level Airborne Infectious Diseases
Authors:
Mei Shan,
Zhou Xuan,
Zhu Yifan,
Zu Zhenghu,
Zheng Tao,
A. V. Boukhanovsky,
P. M. A Sloot
Abstract:
With the exponential growth in the world population and the constant increase in human mobility, the danger of outbreaks of epidemics is rising. Especially in high density urban areas such as public transport and transfer points, where people come in close proximity of each other, we observe a dramatic increase in the transmission of airborne viruses and related pathogens. It is essential to have…
▽ More
With the exponential growth in the world population and the constant increase in human mobility, the danger of outbreaks of epidemics is rising. Especially in high density urban areas such as public transport and transfer points, where people come in close proximity of each other, we observe a dramatic increase in the transmission of airborne viruses and related pathogens. It is essential to have a good understanding of the `transmission highways' in such areas, in order to prevent or to predict the spreading of infectious diseases. The approach we take is to combine as much information as is possible, from all relevant sources and integrate this in a simulation environment that allows for scenario testing and decision support. In this paper we lay out a novel approach to study Urban Airborne Disease spreading by combining traffic information, with geo-spatial data, infection dynamics and spreading characteristics.
△ Less
Submitted 30 December, 2011;
originally announced January 2012.
-
Information Filtering via Implicit Trust-based Network
Authors:
Zhao-Guo Xuan,
Zhan Li,
Jian-Guo Liu
Abstract:
Based on the user-item bipartite network, collaborative filtering (CF) recommender systems predict users' interests according to their history collections, which is a promising way to solve the information exploration problem. However, CF algorithm encounters cold start and sparsity problems. The trust-based CF algorithm is implemented by collecting the users' trust statements, which is time-consu…
▽ More
Based on the user-item bipartite network, collaborative filtering (CF) recommender systems predict users' interests according to their history collections, which is a promising way to solve the information exploration problem. However, CF algorithm encounters cold start and sparsity problems. The trust-based CF algorithm is implemented by collecting the users' trust statements, which is time-consuming and must use users' private friendship information. In this paper, we present a novel measurement to calculate users' implicit trust-based correlation by taking into account their average ratings, rating ranges, and the number of common rated items. By applying the similar idea to the items, a item-based CF algorithm is constructed. The simulation results on three benchmark data sets show that the performances of both user-based and item-based algorithms could be enhanced greatly. Finally, a hybrid algorithm is constructed by integrating the user-based and item-based algorithms, the simulation results indicate that hybrid algorithm outperforms the state-of-the-art methods. Specifically, it can not only provide more accurate recommendations, but also alleviate the cold start problem.
△ Less
Submitted 11 December, 2011;
originally announced December 2011.
-
Degree correlation effect of bipartite network on personalized recommendation
Authors:
Jian-Guo Liu,
Tao Zhou,
Zhao-Guo Xuan,
Hong-An Che,
Bing-Hong Wang,
Yi-Cheng Zhang
Abstract:
In this paper, by introducing a new user similarity index base on the diffusion process, we propose a modified collaborative filtering (MCF) algorithm, which has remarkably higher accuracy than the standard collaborative filtering. In the proposed algorithm, the degree correlation between users and objects is taken into account and embedded into the similarity index by a tunable parameter. The n…
▽ More
In this paper, by introducing a new user similarity index base on the diffusion process, we propose a modified collaborative filtering (MCF) algorithm, which has remarkably higher accuracy than the standard collaborative filtering. In the proposed algorithm, the degree correlation between users and objects is taken into account and embedded into the similarity index by a tunable parameter. The numerical simulation on a benchmark data set shows that the algorithmic accuracy of the MCF, measured by the average ranking score, is further improved by 18.19% in the optimal case. In addition, two significant criteria of algorithmic performance, diversity and popularity, are also taken into account. Numerical results show that the presented algorithm can provide more diverse and less popular recommendations, for example, when the recommendation list contains 10 objects, the diversity, measured by the hamming distance, is improved by 21.90%.
△ Less
Submitted 7 July, 2009;
originally announced July 2009.
-
Common Borel radius of an algebroid function and its derivative
Authors:
Nan Wu,
Zuxing Xuan
Abstract:
In this article, by comparing the characteristic functions, we prove that for any $ν$-valued algebroid function $w(z)$ defined in the unit disk with $\limsup_{r\to1-}T(r,w)/\log\frac{1}{1-r}=\infty$ and the hyper order $ρ_2(w)=0$, the distribution of the Borel radius of $w(z)$ and $w'(z)$ is the same. This is the extension of G. Valiron's conjecture for the meromorphic functions defined in…
▽ More
In this article, by comparing the characteristic functions, we prove that for any $ν$-valued algebroid function $w(z)$ defined in the unit disk with $\limsup_{r\to1-}T(r,w)/\log\frac{1}{1-r}=\infty$ and the hyper order $ρ_2(w)=0$, the distribution of the Borel radius of $w(z)$ and $w'(z)$ is the same. This is the extension of G. Valiron's conjecture for the meromorphic functions defined in $\widehat{\mathbb{C}}$.
△ Less
Submitted 24 June, 2009;
originally announced June 2009.
-
Weighted Network of Chinese Nature Science Basic Research
Authors:
Jian-Guo Liu,
Zhao-Guo Xuan,
Yan-Zhong Dang,
Qiang Guo,
Zhong-Tuo Wang
Abstract:
Using the requisition papers of Chinese Nature Science Basic Research in management and information department, we construct the weighted network of research areas({\bf WRAN}) represented by the subject codes. In WRAN, two research areas are considered connected if they have been filled in at least one requisition paper. The edge weight is defined as the number of requisition papers which have f…
▽ More
Using the requisition papers of Chinese Nature Science Basic Research in management and information department, we construct the weighted network of research areas({\bf WRAN}) represented by the subject codes. In WRAN, two research areas are considered connected if they have been filled in at least one requisition paper. The edge weight is defined as the number of requisition papers which have filled in the same pairs of codes. The node strength is defined as the number of requisition papers which have filled in this code, including the papers which have filled in it only. Here we study a variety of nonlocal statistics for these networks, such as typical distances between research areas through the network, and measures of centrality such as betweenness. These statistics characteristics can illuminate the global development trend of Chinese scientific study, it is also helpful to adjust the code system to reflect the real status more accurately. Finally, we present a plausible model for the formation and structure of networks with the observed properties.
△ Less
Submitted 9 June, 2006;
originally announced June 2006.
-
Self-learning Mutual Selection Model for Weighted Networks
Authors:
Jian-Guo Liu,
Yan-Zhong Dang,
Wen-Xu Wang,
Zhong-Tuo Wang,
Tao Zhou,
Bing-Hong Wang,
Qiang Guo,
Zhao-Guo Xuan,
Shao-Hua Jiang,
Ming-Wei Zhao
Abstract:
In this paper, we propose a self-learning mutual selection model to characterize weighted evolving networks. By introducing the self-learning probability $p$ and the general mutual selection mechanism, which is controlled by the parameter $m$, the model can reproduce scale-free distributions of degree, weight and strength, as found in many real systems. The simulation results are consistent with…
▽ More
In this paper, we propose a self-learning mutual selection model to characterize weighted evolving networks. By introducing the self-learning probability $p$ and the general mutual selection mechanism, which is controlled by the parameter $m$, the model can reproduce scale-free distributions of degree, weight and strength, as found in many real systems. The simulation results are consistent with the theoretical predictions approximately. Interestingly, we obtain the nontrivial clustering coefficient $C$ and tunable degree assortativity $r$, depending on the parameters $m$ and $p$. The model can unify the characterization of both assortative and disassortative weighted networks. Also, we find that self-learning may contribute to the assortative mixing of social networks.
△ Less
Submitted 30 December, 2005;
originally announced December 2005.