Skip to main content

Showing 1–50 of 67 results for author: Min, W

.
  1. arXiv:2406.10261  [pdf, other

    cs.CL cs.AI

    FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination

    Authors: Pengfei Zhou, Weiqing Min, Chaoran Fu, Ying **, Mingyu Huang, Xiangyang Li, Shuhuan Mei, Shuqiang Jiang

    Abstract: Food is foundational to human life, serving not only as a source of nourishment but also as a cornerstone of cultural identity and social interaction. As the complexity of global dietary needs and preferences grows, food intelligence is needed to enable food perception and reasoning for various tasks, ranging from recipe generation and dietary recommendation to diet-disease correlation discovery a… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 32 pages, 19 figures

  2. arXiv:2405.02207  [pdf

    physics.chem-ph

    Water Structure and Electric Fields at the Interface of Oil Droplets

    Authors: Lixue Shi, R. Allen LaCour, Xiaoqi Lang, Joseph P. Heindel, Teresa Head-Gordon, Wei Min

    Abstract: Mesoscale water-hydrophobic interfaces are of fundamental importance in multiple disciplines, but their molecular properties have remained elusive for decades due to experimental complications and alternate theoretical explanations. Surface-specific spectroscopies, such as vibrational sum-frequency techniques, suffer from either sample preparation issues or the need for complex spectral correction… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  3. arXiv:2403.10910  [pdf, other

    cs.LG

    Graph Regularized NMF with L20-norm for Unsupervised Feature Learning

    Authors: Zhen Wang, Wenwen Min

    Abstract: Nonnegative Matrix Factorization (NMF) is a widely applied technique in the fields of machine learning and data mining. Graph Regularized Non-negative Matrix Factorization (GNMF) is an extension of NMF that incorporates graph regularization constraints. GNMF has demonstrated exceptional performance in clustering and dimensionality reduction, effectively discovering inherent low-dimensional structu… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: Submitted to IEEE Trans journal

  4. arXiv:2403.10863  [pdf, other

    q-bio.GN cs.AI cs.LG

    stMCDI: Masked Conditional Diffusion Model with Graph Neural Network for Spatial Transcriptomics Data Imputation

    Authors: Xiaoyu Li, Wenwen Min, Shunfang Wang, Changmiao Wang, Taosheng Xu

    Abstract: Spatially resolved transcriptomics represents a significant advancement in single-cell analysis by offering both gene expression data and their corresponding physical locations. However, this high degree of spatial resolution entails a drawback, as the resulting spatial transcriptomic data at the cellular level is notably plagued by a high incidence of missing values. Furthermore, most existing im… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: Submitted to IJCAI2024

  5. Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection

    Authors: Pengfei Zhou, Weiqing Min, Jiajun Song, Yang Zhang, Shuqiang Jiang

    Abstract: Food computing brings various perspectives to computer vision like vision-based food analysis for nutrition and health. As a fundamental task in food computing, food detection needs Zero-Shot Detection (ZSD) on novel unseen food objects to support real-world scenarios, such as intelligent kitchens and smart restaurants. Therefore, we first benchmark the task of Zero-Shot Food Detection (ZSFD) by i… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: 14 pages, accepted by IEEE Transactions on Image Processing (2024)

  6. arXiv:2312.07473  [pdf

    physics.chem-ph

    Quantum Mechanical Treatment of Stimulated Raman Cross Sections

    Authors: Wei Min, Xin Gao

    Abstract: Stimulated Raman scattering (SRS) has played an increasingly pivotal role in chemistry and photonics. Recently, understanding of light-molecule interaction during SRS was brought to a new quantitative level through the introduction of stimulated Raman cross section, $σ_{SRS}$. Measurements of Raman-active molecules have revealed interesting insights, and theoretical consideration has suggested an… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 19 pages, 2 figures

  7. arXiv:2311.15328  [pdf, other

    eess.IV cs.CV

    BS-Diff: Effective Bone Suppression Using Conditional Diffusion Models from Chest X-Ray Images

    Authors: Zhanghao Chen, Yifei Sun, Wenjian Qin, Ruiquan Ge, Cheng Pan, Wenming Deng, Zhou Liu, Wenwen Min, Ahmed Elazab, Xiang Wan, Changmiao Wang

    Abstract: Chest X-rays (CXRs) are commonly utilized as a low-dose modality for lung screening. Nonetheless, the efficacy of CXRs is somewhat impeded, given that approximately 75% of the lung area overlaps with bone, which in turn hampers the detection and diagnosis of diseases. As a remedial measure, bone suppression techniques have been introduced. The current dual-energy subtraction imaging technique in t… ▽ More

    Submitted 28 February, 2024; v1 submitted 26 November, 2023; originally announced November 2023.

    Comments: 5 pages, 2 figures, accepted by IEEE ISBI 2024

  8. arXiv:2311.02400  [pdf, other

    cs.CY

    From Plate to Production: Artificial Intelligence in Modern Consumer-Driven Food Systems

    Authors: Weiqing Min, Pengfei Zhou, Leyi Xu, Tao Liu, Tianhao Li, Mingyu Huang, Ying **, Yifan Yi, Min Wen, Shuqiang Jiang, Ramesh Jain

    Abstract: Global food systems confront the urgent challenge of supplying sustainable, nutritious diets in the face of escalating demands. The advent of Artificial Intelligence (AI) is bringing in a personal choice revolution, wherein AI-driven individual decisions transform food systems from dinner tables, to the farms, and back to our plates. In this context, AI algorithms refine personal dietary choices,… ▽ More

    Submitted 4 November, 2023; originally announced November 2023.

  9. SeeDS: Semantic Separable Diffusion Synthesizer for Zero-shot Food Detection

    Authors: Pengfei Zhou, Weiqing Min, Yang Zhang, Jiajun Song, Ying **, Shuqiang Jiang

    Abstract: Food detection is becoming a fundamental task in food computing that supports various multimedia applications, including food recommendation and dietary monitoring. To deal with real-world scenarios, food detection needs to localize and recognize novel food objects that are not seen during training, demanding Zero-Shot Detection (ZSD). However, the complexity of semantic attributes and intra-class… ▽ More

    Submitted 7 October, 2023; originally announced October 2023.

    Comments: Accepted by ACM Multimedia 2023

  10. arXiv:2309.09239  [pdf, other

    cs.LG stat.ML

    Globally Convergent Accelerated Algorithms for Multilinear Sparse Logistic Regression with $\ell_0$-constraints

    Authors: Weifeng Yang, Wenwen Min

    Abstract: Tensor data represents a multidimensional array. Regression methods based on low-rank tensor decomposition leverage structural information to reduce the parameter count. Multilinear logistic regression serves as a powerful tool for the analysis of multidimensional data. To improve its efficacy and interpretability, we present a Multilinear Sparse Logistic Regression model with $\ell_0$-constraints… ▽ More

    Submitted 17 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2308.12126

  11. arXiv:2308.12126  [pdf, other

    math.OC cs.LG

    An Accelerated Block Proximal Framework with Adaptive Momentum for Nonconvex and Nonsmooth Optimization

    Authors: Weifeng Yang, Wenwen Min

    Abstract: We propose an accelerated block proximal linear framework with adaptive momentum (ABPL$^+$) for nonconvex and nonsmooth optimization. We analyze the potential causes of the extrapolation step failing in some algorithms, and resolve this issue by enhancing the comparison process that evaluates the trade-off between the proximal gradient step and the linear extrapolation step in our algorithm. Furth… ▽ More

    Submitted 24 August, 2023; v1 submitted 23 August, 2023; originally announced August 2023.

  12. arXiv:2308.06740  [pdf, other

    cs.LG stat.ML

    Weighted Sparse Partial Least Squares for Joint Sample and Feature Selection

    Authors: Wenwen Min, Taosheng Xu, Chris Ding

    Abstract: Sparse Partial Least Squares (sPLS) is a common dimensionality reduction technique for data fusion, which projects data samples from two views by seeking linear combinations with a small number of variables with the maximum variance. However, sPLS extracts the combinations between two data sets with all data samples so that it cannot detect latent subsets of samples. To extend the application of s… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

  13. Precise Facial Landmark Detection by Reference Heatmap Transformer

    Authors: Jun Wan, Jun Liu, Jie Zhou, Zhihui Lai, Linlin Shen, Hang Sun, ** Xiong, Wenwen Min

    Abstract: Most facial landmark detection methods predict landmarks by map** the input facial appearance features to landmark heatmaps and have achieved promising results. However, when the face image is suffering from large poses, heavy occlusions and complicated illuminations, they cannot learn discriminative feature representations and effective facial shape constraints, nor can they accurately predict… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: Accepted by IEEE Transactions on Image Processing, March 2023

  14. arXiv:2210.04399  [pdf, other

    cs.CV

    Deep Learning for Logo Detection: A Survey

    Authors: Sujuan Hou, Jiacheng Li, Weiqing Min, Qiang Hou, Yanna Zhao, Yuanjie Zheng, Shuqiang Jiang

    Abstract: When logos are increasingly created, logo detection has gradually become a research hotspot across many domains and tasks. Recent advances in this area are dominated by deep learning-based solutions, where many datasets, learning strategies, network architectures, etc. have been employed. This paper reviews the advance in applying deep learning techniques to logo detection. Firstly, we discuss a c… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

  15. arXiv:2209.14624  [pdf, other

    cs.LG cs.CV

    Is Complexity Required for Neural Network Pruning? A Case Study on Global Magnitude Pruning

    Authors: Manas Gupta, Efe Camci, Vishandi Rudy Keneta, Abhishek Vaidyanathan, Ritwik Kanodia, Chuan-Sheng Foo, Wu Min, Lin Jie

    Abstract: Pruning neural networks has become popular in the last decade when it was shown that a large number of weights can be safely removed from modern neural networks without compromising accuracy. Numerous pruning methods have been proposed since, each claiming to be better than prior art, however, at the cost of increasingly complex pruning methodologies. These methodologies include utilizing importan… ▽ More

    Submitted 7 January, 2024; v1 submitted 29 September, 2022; originally announced September 2022.

  16. arXiv:2204.10614  [pdf, other

    cs.LG cs.AI cs.SI

    Modelling graph dynamics in fraud detection with "Attention"

    Authors: Susie Xi Rao, Clémence Lanfranchi, Shuai Zhang, Zhichao Han, Zitao Zhang, Wei Min, Mo Cheng, Yinan Shan, Yang Zhao, Ce Zhang

    Abstract: At online retail platforms, detecting fraudulent accounts and transactions is crucial to improve customer experience, minimize loss, and avoid unauthorized transactions. Despite the variety of different models for deep learning on graphs, few approaches have been proposed for dealing with graphs that are both heterogeneous and dynamic. In this paper, we propose DyHGN (Dynamic Heterogeneous Graph N… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: Manuscript under review. arXiv admin note: text overlap with arXiv:2012.10831

  17. arXiv:2203.04910  [pdf, other

    cs.DC cs.AR cs.OS cs.PF

    GPU-Initiated On-Demand High-Throughput Storage Access in the BaM System Architecture

    Authors: Zaid Qureshi, Vikram Sharma Mailthody, Isaac Gelado, Seung Won Min, Amna Masood, Jeongmin Park, **jun Xiong, CJ Newburn, Dmitri Vainbrand, I-Hsin Chung, Michael Garland, William Dally, Wen-mei Hwu

    Abstract: Graphics Processing Units (GPUs) have traditionally relied on the host CPU to initiate access to the data storage. This approach is well-suited for GPU applications with known data access patterns that enable partitioning of their dataset to be processed in a pipelined fashion in the GPU. However, emerging applications such as graph and data analytics, recommender systems, or graph neural networks… ▽ More

    Submitted 6 February, 2023; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: This is an extension to the published conference paper at ASPLOS'23: https://dl.acm.org/doi/abs/10.1145/3575693.3575748

    Journal ref: ASPLOS 2023: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2

  18. arXiv:2203.04559  [pdf, other

    cs.CV

    Source-free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition

    Authors: Yuecong Xu, Jianfei Yang, Haozhi Cao, Keyu Wu, Wu Min, Zhenghua Chen

    Abstract: Video-based Unsupervised Domain Adaptation (VUDA) methods improve the robustness of video models, enabling them to be applied to action recognition tasks across different environments. However, these methods require constant access to source data during the adaptation process. Yet in many real-world applications, subjects and scenes in the source video domain should be irrelevant to those in the t… ▽ More

    Submitted 11 July, 2022; v1 submitted 9 March, 2022; originally announced March 2022.

    Comments: Accepted by ECCV 2022, update to camera-ready version with updated title. 22 pages, 5 figures, 7 tables

  19. arXiv:2111.05894  [pdf, other

    cs.LG

    Graph Neural Network Training with Data Tiering

    Authors: Seung Won Min, Kun Wu, Mert Hidayetoğlu, **jun Xiong, Xiang Song, Wen-mei Hwu

    Abstract: Graph Neural Networks (GNNs) have shown success in learning from graph-structured data, with applications to fraud detection, recommendation, and knowledge graph reasoning. However, training GNN efficiently is challenging because: 1) GPU memory capacity is limited and can be insufficient for large datasets, and 2) the graph-based data structure causes irregular data access patterns. In this work,… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

  20. arXiv:2108.13775   

    cs.CV

    Discriminative Semantic Feature Pyramid Network with Guided Anchoring for Logo Detection

    Authors: Baisong Zhang, Weiqing Min, **g Wang, Sujuan Hou, Qiang Hou, Yuanjie Zheng, Shuqiang Jiang

    Abstract: Recently, logo detection has received more and more attention for its wide applications in the multimedia field, such as intellectual property protection, product brand management, and logo duration monitoring. Unlike general object detection, logo detection is a challenging task, especially for small logo objects and large aspect ratio logo objects in the real-world scenario. In this paper, we pr… ▽ More

    Submitted 6 January, 2023; v1 submitted 31 August, 2021; originally announced August 2021.

    Comments: We are very sorry that the result of the whole experiment is wrong because of the wrong derivation of Equation 3, and we would like to withdraw the manuscript to stop the propagation of the mistake

  21. FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via Multi-Scale Feature Decoupling Network

    Authors: Qiang Hou, Weiqing Min, **g Wang, Sujuan Hou, Yuanjie Zheng, Shuqiang Jiang

    Abstract: Food logo detection plays an important role in the multimedia for its wide real-world applications, such as food recommendation of the self-service shop and infringement detection on e-commerce platforms. A large-scale food logo dataset is urgently needed for develo** advanced food logo detection algorithms. However, there are no available food logo datasets with food brand information. To suppo… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

    Comments: This paper has been accepted to ACM MM 2021. The FoodLogoDet-1500, see https://github.com/hq03/FoodLogoDet-1500-Dataset

  22. A review on vision-based analysis for automatic dietary assessment

    Authors: Wei Wang, Weiqing Min, Tianhao Li, Xiaoxiao Dong, Haisheng Li, Shuqiang Jiang

    Abstract: Background: Maintaining a healthy diet is vital to avoid health-related issues, e.g., undernutrition, obesity and many non-communicable diseases. An indispensable part of the health diet is dietary assessment. Traditional manual recording methods are not only burdensome but time-consuming, and contain substantial biases and errors. Recent advances in Artificial Intelligence (AI), especially comput… ▽ More

    Submitted 6 March, 2022; v1 submitted 6 August, 2021; originally announced August 2021.

    Comments: Accepted by Trends in Food Science & Technology

  23. Applications of knowledge graphs for food science and industry

    Authors: Weiqing Min, Chunlin Liu, Leyi Xu, Shuqiang Jiang

    Abstract: The deployment of various networks (e.g., Internet of Things [IoT] and mobile networks), databases (e.g., nutrition tables and food compositional databases), and social media (e.g., Instagram and Twitter) generates huge amounts of food data, which present researchers with an unprecedented opportunity to study various problems and applications in food science and industry via data-driven computatio… ▽ More

    Submitted 16 May, 2022; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: 45 pages, 6 figures

    Journal ref: Patterns Volume 3, Issue 5 (2022) 100484

  24. arXiv:2104.13171  [pdf, other

    cs.LG

    Structured Sparse Non-negative Matrix Factorization with L20-Norm for scRNA-seq Data Analysis

    Authors: Wenwen Min, Taosheng Xu, Xiang Wan, Tsung-Hui Chang

    Abstract: Non-negative matrix factorization (NMF) is a powerful tool for dimensionality reduction and clustering. Unfortunately, the interpretation of the clustering results from NMF is difficult, especially for the high-dimensional biological data without effective feature selection. In this paper, we first introduce a row-sparse NMF with $\ell_{2,0}$-norm constraint (NMF_$\ell_{20}$), where the basis matr… ▽ More

    Submitted 27 April, 2021; originally announced April 2021.

  25. arXiv:2103.16107  [pdf, other

    cs.CV

    Large Scale Visual Food Recognition

    Authors: Weiqing Min, Zhiling Wang, Yuxin Liu, Mengjiang Luo, Li** Kang, Xiaoming Wei, Xiaolin Wei, Shuqiang Jiang

    Abstract: Food recognition plays an important role in food choice and intake, which is essential to the health and well-being of humans. It is thus of importance to the computer vision community, and can further support many food-oriented vision and multimodal tasks. Unfortunately, we have witnessed remarkable advancements in generic visual recognition for released large-scale datasets, yet largely lags in… ▽ More

    Submitted 26 February, 2023; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence

  26. arXiv:2103.03330  [pdf, other

    cs.LG

    Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture

    Authors: Seung Won Min, Kun Wu, Sitao Huang, Mert Hidayetoğlu, **jun Xiong, Eiman Ebrahimi, Deming Chen, Wen-mei Hwu

    Abstract: Graph Convolutional Networks (GCNs) are increasingly adopted in large-scale graph-based recommender systems. Training GCN requires the minibatch generator traversing graphs and sampling the sparsely located neighboring nodes to obtain their features. Since real-world graphs often exceed the capacity of GPU memory, current GCN training systems keep the feature table in host memory and rely on the C… ▽ More

    Submitted 14 August, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: Paper accepted for PVLDB Vol 14

  27. arXiv:2102.04640  [pdf, other

    cs.IR cs.AI

    Rethinking the Optimization of Average Precision: Only Penalizing Negative Instances before Positive Ones is Enough

    Authors: Zhuo Li, Weiqing Min, Jiajun Song, Yaohui Zhu, Li** Kang, Xiaoming Wei, Xiaolin Wei, Shuqiang Jiang

    Abstract: Optimizing the approximation of Average Precision (AP) has been widely studied for image retrieval. Limited by the definition of AP, such methods consider both negative and positive instances ranking before each positive instance. However, we claim that only penalizing negative instances before positive ones is enough, because the loss only comes from these negative instances. To this end, we prop… ▽ More

    Submitted 7 May, 2022; v1 submitted 8 February, 2021; originally announced February 2021.

  28. arXiv:2101.07956  [pdf, other

    cs.LG cs.PF

    PyTorch-Direct: Enabling GPU Centric Data Access for Very Large Graph Neural Network Training with Irregular Accesses

    Authors: Seung Won Min, Kun Wu, Sitao Huang, Mert Hidayetoğlu, **jun Xiong, Eiman Ebrahimi, Deming Chen, Wen-mei Hwu

    Abstract: With the increasing adoption of graph neural networks (GNNs) in the machine learning community, GPUs have become an essential tool to accelerate GNN training. However, training GNNs on very large graphs that do not fit in GPU memory is still a challenging task. Unlike conventional neural networks, mini-batching input samples in GNNs requires complicated tasks such as traversing neighboring nodes a… ▽ More

    Submitted 19 January, 2021; originally announced January 2021.

  29. arXiv:2101.04285  [pdf, other

    cs.LG

    Explainable Deep Behavioral Sequence Clustering for Transaction Fraud Detection

    Authors: Wei Min, Weiming Liang, Hang Yin, Zhurong Wang, Mei Li, Alok Lal

    Abstract: In e-commerce industry, user behavior sequence data has been widely used in many business units such as search and merchandising to improve their products. However, it is rarely used in financial services not only due to its 3V characteristics - i.e. Volume, Velocity and Variety - but also due to its unstructured nature. In this paper, we propose a Financial Service scenario Deep learning based Be… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

    Comments: Accepted by AAAI2021 KDF Workshop

  30. arXiv:2012.10831  [pdf, other

    cs.LG cs.CR cs.SI

    Suspicious Massive Registration Detection via Dynamic Heterogeneous Graph Neural Networks

    Authors: Susie Xi Rao, Shuai Zhang, Zhichao Han, Zitao Zhang, Wei Min, Mo Cheng, Yinan Shan, Yang Zhao, Ce Zhang

    Abstract: Massive account registration has raised concerns on risk management in e-commerce companies, especially when registration increases rapidly within a short time frame. To monitor these registrations constantly and minimize the potential loss they might incur, detecting massive registration and predicting their riskiness are necessary. In this paper, we propose a Dynamic Heterogeneous Graph Neural N… ▽ More

    Submitted 19 December, 2020; originally announced December 2020.

    Comments: 8 pages, 1 figure, accepted in the AAAI Workshop on Deep Learning on Graphs 2021

    ACM Class: I.2.6

  31. arXiv:2011.12193  [pdf, other

    cs.LG cs.AI cs.SI

    xFraud: Explainable Fraud Transaction Detection

    Authors: Susie Xi Rao, Shuai Zhang, Zhichao Han, Zitao Zhang, Wei Min, Zhiyao Chen, Yinan Shan, Yang Zhao, Ce Zhang

    Abstract: At online retail platforms, it is crucial to actively detect the risks of transactions to improve customer experience and minimize financial loss. In this work, we propose xFraud, an explainable fraud transaction prediction framework which is mainly composed of a detector and an explainer. The xFraud detector can effectively and efficiently predict the legitimacy of incoming transactions. Specific… ▽ More

    Submitted 25 May, 2022; v1 submitted 24 November, 2020; originally announced November 2020.

    Comments: This is the extended version of a full paper to appear in PVLDB 15 (3) (VLDB 2022)

    ACM Class: I.2.6

  32. arXiv:2008.10169  [pdf, other

    cs.AR cs.DC cs.PF

    Tearing Down the Memory Wall

    Authors: Zaid Qureshi, Vikram Sharma Mailthody, Seung Won Min, I-Hsin Chung, **jun Xiong, Wen-mei Hwu

    Abstract: We present a vision for the Erudite architecture that redefines the compute and memory abstractions such that memory bandwidth and capacity become first-class citizens along with compute throughput. In this architecture, we envision coupling a high-density, massively parallel memory technology like Flash with programmable near-data accelerators, like the streaming multiprocessors in modern GPUs. E… ▽ More

    Submitted 23 August, 2020; originally announced August 2020.

    Comments: SRC Techcon 2020 paper. Discusses vision of GPU-Centric architecture, Erudite

  33. arXiv:2008.07960  [pdf, other

    cs.CV

    Dataset Bias in Few-shot Image Recognition

    Authors: Shuqiang Jiang, Yaohui Zhu, Chenlong Liu, Xinhang Song, Xiangyang Li, Weiqing Min

    Abstract: The goal of few-shot image recognition (FSIR) is to identify novel categories with a small number of annotated samples by exploiting transferable knowledge from training data (base categories). Most current studies assume that the transferable knowledge can be well used to identify novel categories. However, such transferable capability may be impacted by the dataset bias, and this problem has rar… ▽ More

    Submitted 15 March, 2021; v1 submitted 18 August, 2020; originally announced August 2020.

  34. arXiv:2008.05655  [pdf, other

    cs.CV cs.MM

    ISIA Food-500: A Dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network

    Authors: Weiqing Min, Linhu Liu, Zhiling Wang, Zhengdong Luo, Xiaoming Wei, Xiaolin Wei, Shuqiang Jiang

    Abstract: Food recognition has received more and more attention in the multimedia community for its various real-world applications, such as diet management and self-service restaurants. A large-scale ontology of food images is urgently needed for develo** advanced large-scale food recognition algorithms, as well as for providing the benchmark dataset for such algorithms. To encourage further progress in… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: Accepted by ACM Multimedia 2020

  35. arXiv:2008.05359  [pdf, other

    cs.CV cs.MM

    LogoDet-3K: A Large-Scale Image Dataset for Logo Detection

    Authors: **g Wang, Weiqing Min, Sujuan Hou, Shengnan Ma, Yuanjie Zheng, Shuqiang Jiang

    Abstract: Logo detection has been gaining considerable attention because of its wide range of applications in the multimedia field, such as copyright infringement detection, brand visibility monitoring, and product brand management on social media. In this paper, we introduce LogoDet-3K, the largest logo detection dataset with full annotation, which has 3,000 logo categories, about 200,000 manually annotate… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

  36. arXiv:2006.06890  [pdf, other

    cs.DC cs.DB

    EMOGI: Efficient Memory-access for Out-of-memory Graph-traversal In GPUs

    Authors: Seung Won Min, Vikram Sharma Mailthody, Zaid Qureshi, **jun Xiong, Eiman Ebrahimi, Wen-mei Hwu

    Abstract: Modern analytics and recommendation systems are increasingly based on graph data that capture the relations between entities being analyzed. Practical graphs come in huge sizes, offer massive parallelism, and are stored in sparse-matrix formats such as CSR. To exploit the massive parallelism, developers are increasingly interested in using GPUs for graph traversal. However, due to their sizes, gra… ▽ More

    Submitted 14 January, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

  37. arXiv:1911.07924  [pdf, other

    cs.CV

    Logo-2K+: A Large-Scale Logo Dataset for Scalable Logo Classification

    Authors: **g Wang, Weiqing Min, Sujuan Hou, Shengnan Ma, Yuanjie Zheng, Haishuai Wang, Shuqiang Jiang

    Abstract: Logo classification has gained increasing attention for its various applications, such as copyright infringement detection, product recommendation and contextual advertising. Compared with other types of object images, the real-world logo images have larger variety in logo appearance and more complexity in their background. Therefore, recognizing the logo from images is challenging. To support eff… ▽ More

    Submitted 11 November, 2019; originally announced November 2019.

    Comments: Accepted by AAAI2020

  38. arXiv:1910.04499  [pdf, other

    cs.LG stat.ML

    DeGNN: Characterizing and Improving Graph Neural Networks with Graph Decomposition

    Authors: Xupeng Miao, Nezihe Merve Gürel, Wentao Zhang, Zhichao Han, Bo Li, Wei Min, Xi Rao, Hansheng Ren, Yinan Shan, Yingxia Shao, Yujie Wang, Fan Wu, Hui Xue, Yaming Yang, Zitao Zhang, Yang Zhao, Shuai Zhang, Yu**g Wang, Bin Cui, Ce Zhang

    Abstract: Despite the wide application of Graph Convolutional Network (GCN), one major limitation is that it does not benefit from the increasing depth and suffers from the oversmoothing problem. In this work, we first characterize this phenomenon from the information-theoretic perspective and show that under certain conditions, the mutual information between the output after $l$ layers and the input of GCN… ▽ More

    Submitted 29 June, 2020; v1 submitted 10 October, 2019; originally announced October 2019.

    Comments: 20 pages, 5 figures, 5 tables

  39. arXiv:1908.01261  [pdf, other

    cs.AR

    Analysis and Optimization of I/O Cache Coherency Strategies for SoC-FPGA Device

    Authors: Seung Won Min, Sitao Huang, Mohamed El-Hadedy, **jun Xiong, Deming Chen, Wen-mei Hwu

    Abstract: Unlike traditional PCIe-based FPGA accelerators, heterogeneous SoC-FPGA devices provide tighter integrations between software running on CPUs and hardware accelerators. Modern heterogeneous SoC-FPGA platforms support multiple I/O cache coherence options between CPUs and FPGAs, but these options can have inadvertent effects on the achieved bandwidths depending on applications and data access patter… ▽ More

    Submitted 3 August, 2019; originally announced August 2019.

  40. arXiv:1905.06269  [pdf, other

    cs.CY cs.IR cs.MM

    Food Recommendation: Framework, Existing Solutions and Challenges

    Authors: Weiqing Min, Shuqiang Jiang, Ramesh Jain

    Abstract: A growing proportion of the global population is becoming overweight or obese, leading to various diseases (e.g., diabetes, ischemic heart disease and even cancer) due to unhealthy eating patterns, such as increased intake of food with high energy and high fat. Food recommendation is of paramount importance to alleviate this problem. Unfortunately, modern multimedia research has enhanced the perfo… ▽ More

    Submitted 19 November, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: Accepted by IEEE Transactions on Multimedia

  41. arXiv:1810.09833  [pdf, other

    cs.MM

    Hierarchy-Dependent Cross-Platform Multi-View Feature Learning for Venue Category Prediction

    Authors: Shuqiang Jiang, Weiqing Min, Shuhuan Mei

    Abstract: In this work, we focus on visual venue category prediction, which can facilitate various applications for location-based service and personalization. Considering that the complementarity of different media platforms, it is reasonable to leverage venue-relevant media data from different platforms to boost the prediction performance. Intuitively, recognizing one venue category involves multiple sema… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

    Comments: Accepted by IEEE Transactions on Multimedia

  42. arXiv:1808.07202  [pdf, other

    cs.CY cs.MM

    A Survey on Food Computing

    Authors: Weiqing Min, Shuqiang Jiang, Linhu Liu, Yong Rui, Ramesh Jain

    Abstract: Food is very essential for human life and it is fundamental to the human experience. Food-related study may support multifarious applications and services, such as guiding the human behavior, improving the human health and understanding the culinary culture. With the rapid development of social networks, mobile networks, and Internet of Things (IoT), people commonly upload, share, and record food… ▽ More

    Submitted 16 July, 2019; v1 submitted 21 August, 2018; originally announced August 2018.

    Comments: Accepted by ACM Computing Surveys

  43. arXiv:1808.05329  [pdf, other

    cs.LG cs.IR stat.ML

    Sequential Behavioral Data Processing Using Deep Learning and the Markov Transition Field in Online Fraud Detection

    Authors: Ruinan Zhang, Fanglan Zheng, Wei Min

    Abstract: Due to the popularity of the Internet and smart mobile devices, more and more financial transactions and activities have been digitalized. Compared to traditional financial fraud detection strategies using credit-related features, customers are generating a large amount of unstructured behavioral data every second. In this paper, we propose an Recurrent Neural Netword (RNN) based deep-learning str… ▽ More

    Submitted 15 August, 2018; originally announced August 2018.

    Comments: KDD2018 Data Science in Fintech Workshop Paper

  44. arXiv:1807.10956  [pdf, other

    stat.ML cs.LG q-bio.QM

    Group-sparse SVD Models and Their Applications in Biological Data

    Authors: Wenwen Min, Juan Liu, Shihua Zhang

    Abstract: Sparse Singular Value Decomposition (SVD) models have been proposed for biclustering high dimensional gene expression data to identify block patterns with similar expressions. However, these models do not take into account prior group effects upon variable selection. To this end, we first propose group-sparse SVD models with group Lasso (GL1-SVD) and group L0-norm penalty (GL0-SVD) for non-overlap… ▽ More

    Submitted 28 July, 2018; originally announced July 2018.

    Comments: 14 pages, 4 figures

  45. arXiv:1801.07239  [pdf, other

    cs.CV cs.MM

    Food recognition and recipe analysis: integrating visual content, context and external knowledge

    Authors: Luis Herranz, Weiqing Min, Shuqiang Jiang

    Abstract: The central role of food in our individual and social life, combined with recent technological advances, has motivated a growing interest in applications that help to better monitor dietary habits as well as the exploration and retrieval of food-related information. We review how visual content, context and external knowledge can be integrated effectively into food-oriented applications, with spec… ▽ More

    Submitted 22 January, 2018; originally announced January 2018.

    Comments: Survey about contextual food recognition and multimodal recipe analysis

  46. arXiv:1710.04792  [pdf, other

    cs.LG stat.ML

    Sparse Weighted Canonical Correlation Analysis

    Authors: Wenwen Min, Juan Liu, Shihua Zhang

    Abstract: Given two data matrices $X$ and $Y$, sparse canonical correlation analysis (SCCA) is to seek two sparse canonical vectors $u$ and $v$ to maximize the correlation between $Xu$ and $Yv$. However, classical and sparse CCA models consider the contribution of all the samples of data matrices and thus cannot identify an underlying specific subset of samples. To this end, we propose a novel sparse weight… ▽ More

    Submitted 12 October, 2017; originally announced October 2017.

    Comments: 8 pages, 5 figures

    ACM Class: I.5.1; H.2.8; G.1.6

  47. arXiv:1609.06480  [pdf, ps, other

    q-bio.GN cs.LG stat.ML

    Network-regularized Sparse Logistic Regression Models for Clinical Risk Prediction and Biomarker Discovery

    Authors: Wenwen Min, Juan Liu, Shihua Zhang

    Abstract: Molecular profiling data (e.g., gene expression) has been used for clinical risk prediction and biomarker discovery. However, it is necessary to integrate other prior knowledge like biological pathways or gene interaction networks to improve the predictive ability and biological interpretability of biomarkers. Here, we first introduce a general regularized Logistic Regression (LR) framework with r… ▽ More

    Submitted 21 September, 2016; originally announced September 2016.

    Comments: 10 pages, 3 figures

    ACM Class: J.3; H.2.8; G.1.6; I.5

  48. arXiv:1603.06035  [pdf, other

    cs.LG stat.ML

    L0-norm Sparse Graph-regularized SVD for Biclustering

    Authors: Wenwen Min, Juan Liu, Shihua Zhang

    Abstract: Learning the "blocking" structure is a central challenge for high dimensional data (e.g., gene expression data). Recently, a sparse singular value decomposition (SVD) has been used as a biclustering tool to achieve this goal. However, this model ignores the structural information between variables (e.g., gene interaction graph). Although typical graph-regularized norm can incorporate such prior gr… ▽ More

    Submitted 18 March, 2016; originally announced March 2016.

    Comments: 8 pages, 2 figures

    ACM Class: I.5.1, I.5.3, H.2.8

  49. arXiv:1511.09368  [pdf, ps, other

    cs.SI physics.soc-ph

    A neurodynamic framework for local community extraction in networks

    Authors: Shihua Zhang, Guanghua Hu, Wenwen Min

    Abstract: To understand the structure and organization of a large-scale social, biological or technological network, it can be helpful to describe and extract local communities or modules of the network. In this article, we develop a neurodynamic framework to describe the local communities which correspond to the stable states of a neuro-system built based on the network. The quantitative criteria to descri… ▽ More

    Submitted 30 November, 2015; originally announced November 2015.

    Comments: 4 figures

  50. arXiv:1507.05696  [pdf, ps, other

    astro-ph.HE astro-ph.IM

    Testing and Performance of UFFO Burst Alert & Trigger Telescope

    Authors: J. Ripa, M. B. Kim, J. Lee, I. H. Park, J. E. Kim, H. Lim, S. Jeong, A. J. Castro-Tirado, P. H. Connell, C. Eyles, V. Reglero, J. M. Rodrigo, V. Bogomolov, M. I. Panasyuk, V. Petrov, S. Svertilov, I. Yashin, S. Brandt, C. Budtz-Jorgensen, Y. -Y. Chang, P. Chen, M. A. Huang, T. -C. Liu, J. W. Nam, M. -Z. Wang , et al. (4 additional authors not shown)

    Abstract: The Ultra-Fast Flash Observatory pathfinder (UFFO-p) is a new space mission dedicated to detect Gamma-Ray Bursts (GRBs) and rapidly follow their afterglows in order to provide early optical/ultraviolet measurements. A GRB location is determined in a few seconds by the UFFO Burst Alert & Trigger telescope (UBAT) employing the coded mask imaging technique and the detector combination of Yttrium Oxyo… ▽ More

    Submitted 20 July, 2015; originally announced July 2015.

    Comments: journal: Proceedings of Science, Swift: 10 Years of Discovery; conference date: 2-5 December 2014; location: La Sapienza University, Rome, Italy; 7 pages, 4 figures; accepted for publication in July 9 2015

    Journal ref: Proceedings of Science 233 (2015) 102