Skip to main content

Showing 1–15 of 15 results for author: Zou, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19680  [pdf, other

    cs.CV cs.AI cs.MM

    MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

    Authors: Yuang Zhang, Jiaxi Gu, Li-Wen Wang, Han Wang, Junqi Cheng, Yuefeng Zhu, Fangyuan Zou

    Abstract: In recent years, generative artificial intelligence has achieved significant advancements in the field of image generation, spawning a variety of applications. However, video generation still faces considerable challenges in various aspects, such as controllability, video length, and richness of details, which hinder the application and popularization of this technology. In this work, we propose a… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  2. arXiv:2401.02339  [pdf

    cs.RO

    How Do Pedestrians' Perception Change toward Autonomous Vehicles during Unmarked Midblock Multilane Crossings: Role of AV Operation and Signal Indication

    Authors: Fengjiao Zou, Jennifer Harper Ogle, Patrick Gerard, Weimin **

    Abstract: One of the primary impediments hindering the widespread acceptance of autonomous vehicles (AVs) among pedestrians is their limited comprehension of AVs. This study employs virtual reality (VR) to provide pedestrians with an immersive environment for engaging with and comprehending AVs during unmarked midblock multilane crossings. Diverse AV driving behaviors were modeled to exhibit negotiation beh… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  3. arXiv:2306.04618  [pdf, other

    cs.CL cs.CR cs.LG

    Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations

    Authors: Lifan Yuan, Yangyi Chen, Ganqu Cui, Hongcheng Gao, Fangyuan Zou, Xingyi Cheng, Heng Ji, Zhiyuan Liu, Maosong Sun

    Abstract: This paper reexamines the research on out-of-distribution (OOD) robustness in the field of NLP. We find that the distribution shift settings in previous studies commonly lack adequate challenges, hindering the accurate evaluation of OOD robustness. To address these issues, we propose a benchmark construction protocol that ensures clear differentiation and challenging distribution shifts. Then we i… ▽ More

    Submitted 26 October, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted to NeurIPS 2023 Dataset and Benchmark Track. Code is available at \url{https://github.com/lifan-yuan/OOD_NLP}

  4. arXiv:2303.17717  [pdf

    cs.RO

    Pedestrian Behavior Interacting with Autonomous Vehicles during Unmarked Midblock Multilane Crossings: Role of Infrastructure Design, AV Operations and Signaling

    Authors: Fengjiao Zou, Jennifer Ogle, Weimin **, Patrick Gerard, Daniel Petty, Andrew Robb

    Abstract: One of the main challenges autonomous vehicles (AVs) will face is interacting with pedestrians, especially at unmarked midblock locations where the right-of-way is unspecified. This study investigates pedestrian crossing behavior given different roadway centerline features (i.e., undivided, two-way left-turn lane (TWLTL), and median) and various AV operational schemes portrayed to pedestrians thro… ▽ More

    Submitted 30 March, 2023; originally announced March 2023.

  5. arXiv:2303.15352  [pdf

    cs.RO

    Pedestrian Behavior Interacting with Autonomous Vehicles: Role of AV Operation and Signal Indication and Roadway Infrastructure

    Authors: Fengjiao Zou, Jennifer Ogle, Weimin **, Patrick Gerard, Daniel Petty, Andrew Robb

    Abstract: Interacting with pedestrians is challenging for Autonomous vehicles (AVs). This study evaluates how AV operations /associated signaling and roadway infrastructure affect pedestrian behavior in virtual reality. AVs were designed with different operations and signal indications, including negotiating with no signal, negotiating with a yellow signal, and yellow/blue negotiating/no-yield indications.… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  6. arXiv:2303.13032  [pdf

    cs.RO eess.SY

    V2V-based Collision-avoidance Decision Strategy for Autonomous Vehicles Interacting with Fully Occluded Pedestrians at Midblock on Multilane Roadways

    Authors: Fengjiao Zou, Hsien-Wen Deng, Tsing-Un Iunn, Jennifer Harper Ogle, Weimin **

    Abstract: Pedestrian occlusion is challenging for autonomous vehicles (AVs) at midblock locations on multilane roadways because an AV cannot detect crossing pedestrians that are fully occluded by downstream vehicles in adjacent lanes. This paper tests the capability of vehicle-to-vehicle (V2V) communication between an AV and its downstream vehicles to share midblock pedestrian crossings information. The res… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  7. arXiv:2212.07080  [pdf

    cs.CY

    Reform and Practice of Computer Application Technology Major Construction and Development in Higher Vocational Colleges in China -- Taking Jiangxi Vocational College of Applied Technology as An Example

    Authors: Yufei Xie, Yue Liu, Fan Zou

    Abstract: This study takes the development path of computer application technology specialty construction in Higher Vocational Colleges under the background of high-level higher vocational schools and specialty construction plan with Chinese characteristics (double high plan) as the main research object, and puts forward the core concept of computer application technology specialty construction and developm… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    ACM Class: K.3.2

    Journal ref: International Journal of Higher Education Teaching Theory, Vol.1, No.4, 2020,243-245

  8. R2RNet: Low-light Image Enhancement via Real-low to Real-normal Network

    Authors: Jiang Hai, Zhu Xuan, Songchen Han, Ren Yang, Yutong Hao, Fengzhu Zou, Fang Lin

    Abstract: Images captured in weak illumination conditions could seriously degrade the image quality. Solving a series of degradation of low-light images can effectively improve the visual quality of images and the performance of high-level visual tasks. In this study, a novel Retinex-based Real-low to Real-normal Network (R2RNet) is proposed for low-light image enhancement, which includes three subnets: a D… ▽ More

    Submitted 11 November, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: 12 pages, 9 figures

    Journal ref: Journal of Visual Communication and Image Representation, 2022

  9. arXiv:2101.05471  [pdf, other

    cs.LG cs.DC math.OC

    Towards Practical Adam: Non-Convexity, Convergence Theory, and Mini-Batch Acceleration

    Authors: Congliang Chen, Li Shen, Fangyu Zou, Wei Liu

    Abstract: Adam is one of the most influential adaptive stochastic algorithms for training deep neural networks, which has been pointed out to be divergent even in the simple convex setting via a few simple counterexamples. Many attempts, such as decreasing an adaptive learning rate, adopting a big batch size, incorporating a temporal decorrelation technique, seeking an analogous surrogate, \textit{etc.}, ha… ▽ More

    Submitted 8 August, 2022; v1 submitted 14 January, 2021; originally announced January 2021.

    Comments: Accepted to JMLR(JMLR). arXiv admin note: substantial text overlap with arXiv:1811.09358

  10. arXiv:2005.13117  [pdf, other

    cs.CV

    SPIN: Structure-Preserving Inner Offset Network for Scene Text Recognition

    Authors: Chengwei Zhang, Yunlu Xu, Zhanzhan Cheng, Shiliang Pu, Yi Niu, Fei Wu, Futai Zou

    Abstract: Arbitrary text appearance poses a great challenge in scene text recognition tasks. Existing works mostly handle with the problem in consideration of the shape distortion, including perspective distortions, line curvature or other style variations. Therefore, methods based on spatial transformers are extensively studied. However, chromatic difficulties in complex scenes have not been paid much atte… ▽ More

    Submitted 25 October, 2021; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: Accepted to AAAI21. Code is available at https://davar-lab.github.io/publication.html or https://github.com/hikopensource/DAVAR-Lab-OCR

  11. arXiv:1908.02422  [pdf, other

    cs.CV

    Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal Action Localization

    Authors: Chengwei Zhang, Yunlu Xu, Zhanzhan Cheng, Yi Niu, Shiliang Pu, Fei Wu, Futai Zou

    Abstract: Temporal action localization is an important yet challenging research topic due to its various applications. Since the frame-level or segment-level annotations of untrimmed videos require amounts of labor expenditure, studies on the weakly-supervised action detection have been springing up. However, most of existing frameworks rely on Class Activation Sequence (CAS) to localize actions by minimizi… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

    Comments: To be appeared in ACM MM2019

  12. arXiv:1904.09290  [pdf, other

    cs.CV cs.AI cs.LG

    FeatherNets: Convolutional Neural Networks as Light as Feather for Face Anti-spoofing

    Authors: Peng Zhang, Fuhao Zou, Zhiwen Wu, Nengli Dai, Skarpness Mark, Michael Fu, Juan Zhao, Kai Li

    Abstract: Face Anti-spoofing gains increased attentions recently in both academic and industrial fields. With the emergence of various CNN based solutions, the multi-modal(RGB, depth and IR) methods based CNN showed better performance than single modal classifiers. However, there is a need for improving the performance and reducing the complexity. Therefore, an extreme light network architecture(FeatherNet… ▽ More

    Submitted 22 April, 2019; originally announced April 2019.

    Comments: 10 pages;6 figures

  13. Vector and Line Quantization for Billion-scale Similarity Search on GPUs

    Authors: Wei Chen, **cai Chen, Fuhao Zou, Yuan-Fang Li, ** Lu, Qiang Wang, Wei Zhao

    Abstract: Billion-scale high-dimensional approximate nearest neighbour (ANN) search has become an important problem for searching similar objects among the vast amount of images and videos available online. The existing ANN methods are usually characterized by their specific indexing structures, including the inverted index and the inverted multi-index structure. The inverted index structure is amenable to… ▽ More

    Submitted 18 April, 2019; v1 submitted 2 January, 2019; originally announced January 2019.

    Comments: Accepted by Future Generation Computer Systems (FGCS)

  14. arXiv:1811.09358  [pdf, ps, other

    cs.LG cs.CV math.NA math.OC stat.ML

    A Sufficient Condition for Convergences of Adam and RMSProp

    Authors: Fangyu Zou, Li Shen, Zequn Jie, Weizhong Zhang, Wei Liu

    Abstract: Adam and RMSProp are two of the most influential adaptive stochastic algorithms for training deep neural networks, which have been pointed out to be divergent even in the convex setting via a few simple counterexamples. Many attempts, such as decreasing an adaptive learning rate, adopting a big batch size, incorporating a temporal decorrelation technique, seeking an analogous surrogate, etc., have… ▽ More

    Submitted 24 June, 2019; v1 submitted 22 November, 2018; originally announced November 2018.

    Comments: Accepted by CVPR2019 as an Oral presentation

  15. arXiv:1808.03408  [pdf, ps, other

    cs.LG math.NA math.OC stat.ML

    A Unified Analysis of AdaGrad with Weighted Aggregation and Momentum Acceleration

    Authors: Li Shen, Congliang Chen, Fangyu Zou, Zequn Jie, Ju Sun, Wei Liu

    Abstract: Integrating adaptive learning rate and momentum techniques into SGD leads to a large class of efficiently accelerated adaptive stochastic algorithms, such as AdaGrad, RMSProp, Adam, AccAdaGrad, \textit{etc}. In spite of their effectiveness in practice, there is still a large gap in their theories of convergences, especially in the difficult non-convex stochastic setting. To fill this gap, we propo… ▽ More

    Submitted 15 May, 2023; v1 submitted 10 August, 2018; originally announced August 2018.

    Comments: IEEE TNNLS