Skip to main content

Showing 1–33 of 33 results for author: Lin, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2402.17645  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    SongComposer: A Large Language Model for Lyric and Melody Composition in Song Generation

    Authors: Shuangrui Ding, Zihan Liu, Xiaoyi Dong, Pan Zhang, Rui Qian, Conghui He, Dahua Lin, Jiaqi Wang

    Abstract: We present SongComposer, an innovative LLM designed for song composition. It could understand and generate melodies and lyrics in symbolic song representations, by leveraging the capability of LLM. Existing music-related LLM treated the music as quantized audio signals, while such implicit encoding leads to inefficient encoding and poor flexibility. In contrast, we resort to symbolic song represen… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: project page: https://pjlab-songcomposer.github.io/ code: https://github.com/pjlab-songcomposer/songcomposer

  2. arXiv:2312.15633  [pdf, other

    cs.CV eess.IV

    MuLA-GAN: Multi-Level Attention GAN for Enhanced Underwater Visibility

    Authors: Ahsan Baidar Bakht, Zikai Jia, Muhayy ud Din, Waseem Akram, Lyes Saad Soud, Lakmal Seneviratne, Defu Lin, Shaoming He, Irfan Hussain

    Abstract: The underwater environment presents unique challenges, including color distortions, reduced contrast, and blurriness, hindering accurate analysis. In this work, we introduce MuLA-GAN, a novel approach that leverages the synergistic power of Generative Adversarial Networks (GANs) and Multi-Level Attention mechanisms for comprehensive underwater image enhancement. The integration of Multi-Level Atte… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  3. arXiv:2312.09452  [pdf, other

    eess.SP cs.IT

    Efficient Multi-Pair IoT Communication with Holographically Enhanced Meta-Surfaces Leveraging OAM Beams: Bridging Theory and Prototype

    Authors: Yufei Zhao, Yong Liang Guan, Afkar Mohamed Ismail, Gaohua Ju, Deyu Lin, Yilong Lu, Chau Yuen

    Abstract: Meta-surfaces, also known as Reconfigurable Intelligent Surfaces (RIS), have emerged as a cost-effective, low power consumption, and flexible solution for enabling multiple applications in Internet of Things (IoT). However, in the context of meta-surface-assisted multi-pair IoT communications, significant interference issues often arise amount multiple channels. This issue is particularly pronounc… ▽ More

    Submitted 18 November, 2023; originally announced December 2023.

    Comments: Meta-surface, RIS, Internet-of-Things (IoT), Line-of-Sight (LoS), Orbital Angular Momentum (OAM), holographic communications, multi-user

  4. arXiv:2311.05609  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    What Do I Hear? Generating Sounds for Visuals with ChatGPT

    Authors: David Chuan-En Lin, Nikolas Martelaro

    Abstract: This short paper introduces a workflow for generating realistic soundscapes for visual media. In contrast to prior work, which primarily focus on matching sounds for on-screen visuals, our approach extends to suggesting sounds that may not be immediately visible but are essential to crafting a convincing and immersive auditory environment. Our key insight is leveraging the reasoning capabilities o… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Demo: http://soundify.cc

  5. arXiv:2309.07178  [pdf

    q-bio.QM cs.AI cs.LG eess.SP

    CloudBrain-NMR: An Intelligent Cloud Computing Platform for NMR Spectroscopy Processing, Reconstruction and Analysis

    Authors: Di Guo, Si** Li, Jun Liu, Zhangren Tu, Tianyu Qiu, **g**g Xu, Liubin Feng, Donghai Lin, Qing Hong, Mei** Lin, Yanqin Lin, Xiaobo Qu

    Abstract: Nuclear Magnetic Resonance (NMR) spectroscopy has served as a powerful analytical tool for studying molecular structure and dynamics in chemistry and biology. However, the processing of raw data acquired from NMR spectrometers and subsequent quantitative analysis involves various specialized tools, which necessitates comprehensive knowledge in programming and NMR. Particularly, the emerging deep l… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 11 pages, 13 figures

  6. arXiv:2306.07263  [pdf, other

    eess.SY

    Enlarging Stability Region of Urban Networks with Imminent Supply Prediction

    Authors: Dianchao Lin, Li Li

    Abstract: Stability region is a key index to characterize a dynamic processing system's ability to handle incoming demands. It is a multidimensional space when the system has multiple OD pairs where their service rates interact. Urban traffic network is such a system. Traffic congestion appears when its demand approaches or exceeds the upper frontier of its stability region. In this decade, with the rapid d… ▽ More

    Submitted 8 April, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

  7. An Efficient Safety-oriented Car-following Model for Connected Automated Vehicles Considering Discrete Signals

    Authors: Dianchao Lin, Li Li

    Abstract: With the rapid development of Connected and Automated Vehicle (CAV) technology, limited self-driving vehicles have been commercially available in certain leading intelligent transportation system countries. When formulating the car-following model for CAVs, safety is usually the basic constraint. Safety-oriented car-following models seek to specify a safe following distance that can guarantee safe… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  8. arXiv:2305.17183  [pdf

    q-bio.QM cs.AI eess.IV

    ProGroTrack: Deep Learning-Assisted Tracking of Intracellular Protein Growth Dynamics

    Authors: Kai San Chan, Huimiao Chen, Chenyu **, Yuxuan Tian, Dingchang Lin

    Abstract: Accurate tracking of cellular and subcellular structures, along with their dynamics, plays a pivotal role in understanding the underlying mechanisms of biological systems. This paper presents a novel approach, ProGroTrack, that combines the You Only Look Once (YOLO) and ByteTrack algorithms within the detection-based tracking (DBT) framework to track intracellular protein nanostructures. Focusing… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  9. arXiv:2304.14920  [pdf, other

    eess.SP cs.AI cs.LG

    An EEG Channel Selection Framework for Driver Drowsiness Detection via Interpretability Guidance

    Authors: Xinliang Zhou, Dan Lin, Ziyu Jia, Chenyu Liu, Liming Zhai, Yang Liu

    Abstract: Drowsy driving has a crucial influence on driving safety, creating an urgent demand for driver drowsiness detection. Electroencephalogram (EEG) signal can accurately reflect the mental fatigue state and thus has been widely studied in drowsiness monitoring. However, the raw EEG data is inherently noisy and redundant, which is neglected by existing works that just use single-channel EEG data or ful… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

  10. arXiv:2301.12688  [pdf, other

    cs.GR cs.CV cs.HC cs.MM eess.IV

    Dynamic Storyboard Generation in an Engine-based Virtual Environment for Video Production

    Authors: Anyi Rao, Xuekun Jiang, Yuwei Guo, Linning Xu, Lei Yang, Libiao **, Dahua Lin, Bo Dai

    Abstract: Amateurs working on mini-films and short-form videos usually spend lots of time and effort on the multi-round complicated process of setting and adjusting scenes, plots, and cameras to deliver satisfying video shots. We present Virtual Dynamic Storyboard (VDS) to allow users storyboarding shots in virtual environments, where the filming staff can easily test the settings of shots before the actual… ▽ More

    Submitted 21 July, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

    Comments: Project page: https://virtualfilmstudio.github.io/

  11. arXiv:2209.08500  [pdf, other

    eess.SY cs.LG

    A Map-matching Algorithm with Extraction of Multi-group Information for Low-frequency Data

    Authors: Jie Fang, Xiongwei Wu, Dianchao Lin, Mengyun Xu, Huahua Wu, Xuesong Wu, Ting Bi

    Abstract: The growing use of probe vehicles generates a huge number of GNSS data. Limited by the satellite positioning technology, further improving the accuracy of map-matching is challenging work, especially for low-frequency trajectories. When matching a trajectory, the ego vehicle's spatial-temporal information of the present trip is the most useful with the least amount of data. In addition, there are… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: 10 pages, 11 figures, 4 tables

  12. arXiv:2209.04547  [pdf, other

    cs.CR cs.AI cs.LG cs.SD eess.AS

    Defend Data Poisoning Attacks on Voice Authentication

    Authors: Ke Li, Cameron Baird, Dan Lin

    Abstract: With the advances in deep learning, speaker verification has achieved very high accuracy and is gaining popularity as a type of biometric authentication option in many scenes of our daily life, especially the growing market of web services. Compared to traditional passwords, "vocal passwords" are much more convenient as they relieve people from memorizing different passwords. However, new machine… ▽ More

    Submitted 7 July, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

    Journal ref: IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, VOL. 14, NO. 8, AUGUST 2022

  13. arXiv:2206.14940  [pdf, other

    eess.IV

    Physics-Inspired Unsupervised Classification for Region of Interest in X-Ray Ptychography

    Authors: Dergan Lin, Yi Jiang, Jun**g Deng, Zichao Wendy Di

    Abstract: X-ray ptychography allows for large fields to be imaged at high resolution at the cost of additional computational expense due to the large volume of data. Given limited information regarding the object, the acquired data often has an excessive amount of information that is outside the region of interest (RoI). In this work we propose a physics-inspired unsupervised learning algorithm to identify… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

  14. arXiv:2204.11669  [pdf

    eess.IV cs.AI physics.med-ph

    Deep-learning-enabled Brain Hemodynamic Map** Using Resting-state fMRI

    Authors: Xirui Hou, Pengfei Guo, Puyang Wang, Peiying Liu, Doris D. M. Lin, Hongli Fan, Yang Li, Zhiliang Wei, Zixuan Lin, Dengrong Jiang, ** **, Catherine Kelly, Jay J. Pillai, Judy Huang, Marco C. Pinho, Binu P. Thomas, Babu G. Welch, Denise C. Park, Vishal M. Patel, Argye E. Hillis, Hanzhang Lu

    Abstract: Cerebrovascular disease is a leading cause of death globally. Prevention and early intervention are known to be the most effective forms of its management. Non-invasive imaging methods hold great promises for early stratification, but at present lack the sensitivity for personalized prognosis. Resting-state functional magnetic resonance imaging (rs-fMRI), a powerful tool previously used for mappin… ▽ More

    Submitted 25 April, 2022; originally announced April 2022.

    Journal ref: npj Digital Medicine (2023) 116

  15. arXiv:2201.02366  [pdf, other

    cs.CV eess.IV

    Uncertainty-Aware Cascaded Dilation Filtering for High-Efficiency Deraining

    Authors: Qing Guo, **gyang Sun, Felix Juefei-Xu, Lei Ma, Di Lin, Wei Feng, Song Wang

    Abstract: Deraining is a significant and fundamental computer vision task, aiming to remove the rain streaks and accumulations in an image or video captured under a rainy day. Existing deraining methods usually make heuristic assumptions of the rain model, which compels them to employ complex optimization or iterative refinement for high recovery quality. This, however, leads to time-consuming methods and a… ▽ More

    Submitted 7 January, 2022; originally announced January 2022.

    Comments: 14 pages, 10 figures, 10 tables. This is the extention of our conference version https://github.com/tsingqguo/efficientderain

  16. arXiv:2201.00317  [pdf, other

    eess.IV cs.CV

    Recurrent Feature Propagation and Edge Skip-Connections for Automatic Abdominal Organ Segmentation

    Authors: Zefan Yang, Di Lin, Dong Ni, Yi Wang

    Abstract: Automatic segmentation of abdominal organs in computed tomography (CT) images can support radiation therapy and image-guided surgery workflows. Develo** of such automatic solutions remains challenging mainly owing to complex organ interactions and blurry boundaries in CT images. To address these issues, we focus on effective spatial context modeling and explicit edge segmentation priors. Accordi… ▽ More

    Submitted 19 May, 2023; v1 submitted 2 January, 2022; originally announced January 2022.

  17. arXiv:2112.09726  [pdf, other

    cs.SD cs.CV cs.HC cs.MM eess.AS

    Soundify: Matching Sound Effects to Video

    Authors: David Chuan-En Lin, Anastasis Germanidis, Cristóbal Valenzuela, Yining Shi, Nikolas Martelaro

    Abstract: In the art of video editing, sound helps add character to an object and immerse the viewer within a space. Through formative interviews with professional editors (N=10), we found that the task of adding sounds to video can be challenging. This paper presents Soundify, a system that assists editors in matching sounds to video. Given a video, Soundify identifies matching sounds, synchronizes the sou… ▽ More

    Submitted 25 June, 2024; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: https://soundify.cc

  18. arXiv:2104.06162  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Visually Informed Binaural Audio Generation without Binaural Audios

    Authors: Xudong Xu, Hang Zhou, Ziwei Liu, Bo Dai, Xiaogang Wang, Dahua Lin

    Abstract: Stereophonic audio, especially binaural audio, plays an essential role in immersive viewing environments. Recent research has explored generating visually guided stereophonic audios supervised by multi-channel audio collections. However, due to the requirement of professional recording devices, existing datasets are limited in scale and variety, which impedes the generalization of supervised metho… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: Accepted by CVPR 2021. Code, models, and demo video are available on our webpage: \<https://sheldontsui.github.io/projects/PseudoBinaural>

  19. arXiv:2012.14830  [pdf

    cs.LG eess.IV physics.bio-ph physics.med-ph

    A Sparse Model-inspired Deep Thresholding Network for Exponential Signal Reconstruction -- Application in Fast Biological Spectroscopy

    Authors: Zi Wang, Di Guo, Zhangren Tu, Yihui Huang, Yirong Zhou, Jian Wang, Liubin Feng, Donghai Lin, Yongfu You, Tatiana Agback, Vladislav Orekhov, Xiaobo Qu

    Abstract: The non-uniform sampling is a powerful approach to enable fast acquisition but requires sophisticated reconstruction algorithms. Faithful reconstruction from partial sampled exponentials is highly expected in general signal processing and many applications. Deep learning has shown astonishing potential in this field but many existing problems, such as lack of robustness and explainability, greatly… ▽ More

    Submitted 17 January, 2022; v1 submitted 29 December, 2020; originally announced December 2020.

    Comments: 30 pages

  20. arXiv:2010.10298  [pdf

    eess.IV cs.CV

    The Detection of Thoracic Abnormalities ChestX-Det10 Challenge Results

    Authors: Jie Lian, **gyu Liu, Yizhou Yu, Mengyuan Ding, Yaoci Lu, Yi Lu, Jie Cai, Deshou Lin, Miao Zhang, Zhe Wang, Kai He, Yijie Yu

    Abstract: The detection of thoracic abnormalities challenge is organized by the Deepwise AI Lab. The challenge is divided into two rounds. In this paper, we present the results of 6 teams which reach the second round. The challenge adopts the ChestX-Det10 dateset proposed by the Deepwise AI Lab. ChestX-Det10 is the first chest X-Ray dataset with instance-level annotations, including 10 categories of disease… ▽ More

    Submitted 21 October, 2020; v1 submitted 19 October, 2020; originally announced October 2020.

  21. High Quality Remote Sensing Image Super-Resolution Using Deep Memory Connected Network

    Authors: Wenjia Xu, Guangluan Xu, Yang Wang, Xian Sun, Daoyu Lin, Yirong Wu

    Abstract: Single image super-resolution is an effective way to enhance the spatial resolution of remote sensing image, which is crucial for many applications such as target detection and image classification. However, existing methods based on the neural network usually have small receptive fields and ignore the image detail. We propose a novel method named deep memory connected network (DMCN) based on a co… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

    Comments: IGARSS 2018 - 2018 IEEE International Geoscience and Remote Sensing Symposium

  22. arXiv:2008.03548  [pdf, other

    cs.CV cs.LG cs.MM eess.IV

    A Unified Framework for Shot Type Classification Based on Subject Centric Lens

    Authors: Anyi Rao, Jiaze Wang, Linning Xu, Xuekun Jiang, Qingqiu Huang, Bolei Zhou, Dahua Lin

    Abstract: Shots are key narrative elements of various videos, e.g. movies, TV series, and user-generated videos that are thriving over the Internet. The types of shots greatly influence how the underlying ideas, emotions, and messages are expressed. The technique to analyze shot types is important to the understanding of videos, which has seen increasing demand in real-world applications in this era. Classi… ▽ More

    Submitted 8 August, 2020; originally announced August 2020.

    Comments: ECCV2020. Project page: https://anyirao.com/projects/ShotType.html

  23. arXiv:2008.03546  [pdf, other

    cs.CV cs.AI cs.LG cs.MM eess.IV

    Online Multi-modal Person Search in Videos

    Authors: Jiangyue Xia, Anyi Rao, Qingqiu Huang, Linning Xu, Jiangtao Wen, Dahua Lin

    Abstract: The task of searching certain people in videos has seen increasing potential in real-world applications, such as video organization and editing. Most existing approaches are devised to work in an offline manner, where identities can only be inferred after an entire video is examined. This working manner precludes such methods from being applied to online services or those applications that require… ▽ More

    Submitted 8 August, 2020; originally announced August 2020.

    Comments: ECCV2020. Project page: http://movienet.site/projects/eccv20onlineperson.html

  24. arXiv:2007.09902  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Sep-Stereo: Visually Guided Stereophonic Audio Generation by Associating Source Separation

    Authors: Hang Zhou, Xudong Xu, Dahua Lin, Xiaogang Wang, Ziwei Liu

    Abstract: Stereophonic audio is an indispensable ingredient to enhance human auditory experience. Recent research has explored the usage of visual information as guidance to generate binaural or ambisonic audio from mono ones with stereo supervision. However, this fully supervised paradigm suffers from an inherent drawback: the recording of stereophonic audio usually requires delicate devices that are expen… ▽ More

    Submitted 20 July, 2020; originally announced July 2020.

    Comments: To appear in Proceedings of the European Conference on Computer Vision (ECCV), 2020. Code, models, and video results are available on our webpage: https://hangz-nju-cuhk.github.io/projects/Sep-Stereo

  25. Comparative Analysis of Economic Instruments in Intersection Operation: A User-Based Perspective

    Authors: DianChao Lin, Saif Eddin Jabari

    Abstract: Focusing on different economic instruments implemented in intersection operations under a connected environment, this paper analyzes their advantages and disadvantages from the travelers' perspective. Travelers' concerns revolve around whether a new instrument is easy to learn and operate, whether it can save time or money, and whether it can reduce the rich-poor gap. After a comparative analysis,… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: 6 pages, 8 figures, 6 tables, IEEE-ITSC2020

    Report number: 2020

    Journal ref: The 23rd IEEE International Conference on Intelligent Transportation Systems, 2020

  26. A User-Based Charge and Subsidy Scheme for Single O-D Network Mobility Management

    Authors: Li Li, Dianchao Lin, Saif Eddin Jabari

    Abstract: We propose a path guidance system with a user-based charge and subsidy (UBCS) scheme for single O-D network mobility management. Users who are willing to join the scheme (subscribers) can submit travel requests along with their VOTs to the system before traveling. Those who are not willing to join (outsiders) only need to submit travel requests to the system. Our system will give all users path gu… ▽ More

    Submitted 15 June, 2020; originally announced June 2020.

    Comments: 6 pages, 3 figures, 2 tables, IEEE ITSC 2020

    Report number: 2020

    Journal ref: The 23rd IEEE International Conference on Intelligent Transportation Systems, 2020

  27. arXiv:2003.13659  [pdf, other

    eess.IV cs.CV

    Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation

    Authors: Xingang Pan, Xiaohang Zhan, Bo Dai, Dahua Lin, Chen Change Loy, ** Luo

    Abstract: Learning a good image prior is a long-term goal for image restoration and manipulation. While existing methods like deep image prior (DIP) capture low-level image statistics, there are still gaps toward an image prior that captures rich image semantics including color, spatial coherence, textures, and high-level concepts. This work presents an effective way to exploit the image prior captured by a… ▽ More

    Submitted 20 July, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

    Comments: Accepted to ECCV2020 as oral. 1) Precise GAN-inversion by discriminator-guided generator finetuning. 2) A versatile way for high-quality image restoration and manipulation. Code: https://github.com/XingangPan/deep-generative-prior

  28. arXiv:2002.05512  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Real or Not Real, that is the Question

    Authors: Yuanbo Xiangli, Yubin Deng, Bo Dai, Chen Change Loy, Dahua Lin

    Abstract: While generative adversarial networks (GAN) have been widely adopted in various topics, in this paper we generalize the standard GAN to a new perspective by treating realness as a random variable that can be estimated from multiple angles. In this generalized framework, referred to as RealnessGAN, the discriminator outputs a distribution as the measure of realness. While RealnessGAN shares similar… ▽ More

    Submitted 12 February, 2020; originally announced February 2020.

    Comments: ICLR2020 spotlight. 1) train GAN by maximizing kl-divergence. 2) train non-progressive GAN (DCGAN) architecture at 1024*1024 resolution

  29. Pay for Intersection Priority: A Free Market Mechanism for Connected Vehicles

    Authors: DianChao Lin, Saif Eddin Jabari

    Abstract: The rapid development and deployment of vehicle technologies offer opportunities to re-think the way traffic is managed. This paper capitalizes on vehicle connectivity and proposes an economic instrument and corresponding cooperative framework for allocating priority at intersections. The framework is compatible with a variety of existing intersection control approaches. Similar to free markets, o… ▽ More

    Submitted 29 December, 2020; v1 submitted 6 January, 2020; originally announced January 2020.

    Journal ref: IEEE Transactions on Intelligent Transportation Systems, Vol. 23, No. 6, 2022

  30. arXiv:1912.00191  [pdf, other

    cs.AI cs.RO eess.SP

    Learning a Decision Module by Imitating Driver's Control Behaviors

    Authors: Junning Huang, Sirui Xie, Jiankai Sun, Qiurui Ma, Chunxiao Liu, Jian** Shi, Dahua Lin, Bolei Zhou

    Abstract: Autonomous driving systems have a pipeline of perception, decision, planning, and control. The decision module processes information from the perception module and directs the execution of downstream planning and control modules. On the other hand, the recent success of deep learning suggests that this pipeline could be replaced by end-to-end neural control policies, however, safety cannot be well… ▽ More

    Submitted 5 May, 2021; v1 submitted 30 November, 2019; originally announced December 2019.

    Comments: Proceedings of the Conference on Robot Learning (CoRL) 2020

  31. arXiv:1908.11602  [pdf, other

    cs.CV cs.SD eess.AS

    Recursive Visual Sound Separation Using Minus-Plus Net

    Authors: Xudong Xu, Bo Dai, Dahua Lin

    Abstract: Sounds provide rich semantics, complementary to visual data, for many tasks. However, in practice, sounds from multiple sources are often mixed together. In this paper we propose a novel framework, referred to as MinusPlus Network (MP-Net), for the task of visual sound separation. MP-Net separates sounds recursively in the order of average energy, removing the separated sound from the mixture at t… ▽ More

    Submitted 23 October, 2019; v1 submitted 30 August, 2019; originally announced August 2019.

    Comments: accepted by ICCV2019

  32. arXiv:1906.07155  [pdf, other

    cs.CV cs.LG eess.IV

    MMDetection: Open MMLab Detection Toolbox and Benchmark

    Authors: Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, **gdong Wang, Jian** Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin

    Abstract: We present MMDetection, an object detection toolbox that contains a rich set of object detection and instance segmentation methods as well as related components and modules. The toolbox started from a codebase of MMDet team who won the detection track of COCO Challenge 2018. It gradually evolves into a unified platform that covers many popular detection methods and contemporary modules. It not onl… ▽ More

    Submitted 17 June, 2019; originally announced June 2019.

    Comments: Technical report of MMDetection. 11 pages

  33. Traffic state estimation using stochastic Lagrangian dynamics

    Authors: Fangfang Zheng, Saif Eddin Jabari, Henry X. Liu, DianChao Lin

    Abstract: This paper proposes a new stochastic model of traffic dynamics in Lagrangian coordinates. The source of uncertainty is heterogeneity in driving behavior, captured using driver-specific speed-spacing relations, i.e., parametric uncertainty. It also results in smooth vehicle trajectories in a stochastic context, which is in agreement with real-world traffic dynamics and, thereby, overcoming issues w… ▽ More

    Submitted 31 May, 2018; originally announced June 2018.

    Journal ref: Transportation Research Part B: Methodological Volume 115, September 2018, Pages 143-165