-
EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy
Authors:
Long Bai,
Tong Chen,
Qiaozhi Tan,
Wan Jun Nah,
Yanheng Li,
Zhicheng He,
Sishen Yuan,
Zhen Chen,
**lin Wu,
Mobarakol Islam,
Zhen Li,
Hongbin Liu,
Hongliang Ren
Abstract:
Wireless Capsule Endoscopy (WCE) is highly valued for its non-invasive and painless approach, though its effectiveness is compromised by uneven illumination from hardware constraints and complex internal dynamics, leading to overexposed or underexposed images. While researchers have discussed the challenges of low-light enhancement in WCE, the issue of correcting for different exposure levels rema…
▽ More
Wireless Capsule Endoscopy (WCE) is highly valued for its non-invasive and painless approach, though its effectiveness is compromised by uneven illumination from hardware constraints and complex internal dynamics, leading to overexposed or underexposed images. While researchers have discussed the challenges of low-light enhancement in WCE, the issue of correcting for different exposure levels remains underexplored. To tackle this, we introduce EndoUIC, a WCE unified illumination correction solution using an end-to-end promptable diffusion transformer (DiT) model. In our work, the illumination prompt module shall navigate the model to adapt to different exposure levels and perform targeted image enhancement, in which the Adaptive Prompt Integration (API) and Global Prompt Scanner (GPS) modules shall further boost the concurrent representation learning between the prompt parameters and features. Besides, the U-shaped restoration DiT model shall capture the long-range dependencies and contextual information for unified illumination restoration. Moreover, we present a novel Capsule-endoscopy Exposure Correction (CEC) dataset, including ground-truth and corrupted image pairs annotated by expert photographers. Extensive experiments against a variety of state-of-the-art (SOTA) methods on four datasets showcase the effectiveness of our proposed method and components in WCE illumination restoration, and the additional downstream experiments further demonstrate its utility for clinical diagnosis and surgical assistance.
△ Less
Submitted 8 July, 2024; v1 submitted 19 June, 2024;
originally announced June 2024.
-
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery
Authors:
Guankun Wang,
Long Bai,
Wan Jun Nah,
Jie Wang,
Zhaoxi Zhang,
Zhen Chen,
**lin Wu,
Mobarakol Islam,
Hongbin Liu,
Hongliang Ren
Abstract:
Recent advancements in Surgical Visual Question Answering (Surgical-VQA) and related region grounding have shown great promise for robotic and medical applications, addressing the critical need for automated methods in personalized surgical mentorship. However, existing models primarily provide simple structured answers and struggle with complex scenarios due to their limited capability in recogni…
▽ More
Recent advancements in Surgical Visual Question Answering (Surgical-VQA) and related region grounding have shown great promise for robotic and medical applications, addressing the critical need for automated methods in personalized surgical mentorship. However, existing models primarily provide simple structured answers and struggle with complex scenarios due to their limited capability in recognizing long-range dependencies and aligning multimodal information. In this paper, we introduce Surgical-LVLM, a novel personalized large vision-language model tailored for complex surgical scenarios. Leveraging the pre-trained large vision-language model and specialized Visual Perception LoRA (VP-LoRA) blocks, our model excels in understanding complex visual-language tasks within surgical contexts. In addressing the visual grounding task, we propose the Token-Interaction (TIT) module, which strengthens the interaction between the grounding module and the language responses of the Large Visual Language Model (LVLM) after projecting them into the latent space. We demonstrate the effectiveness of Surgical-LVLM on several benchmarks, including EndoVis-17-VQLA, EndoVis-18-VQLA, and a newly introduced EndoVis Conversations dataset, which sets new performance standards. Our work contributes to advancing the field of automated surgical mentorship by providing a context-aware solution.
△ Less
Submitted 22 March, 2024;
originally announced May 2024.
-
Text in the Dark: Extremely Low-Light Text Image Enhancement
Authors:
Che-Tsung Lin,
Chun Chet Ng,
Zhi Qin Tan,
Wan Jun Nah,
Xinyu Wang,
Jie Long Kew,
Pohao Hsu,
Shang Hong Lai,
Chee Seng Chan,
Christopher Zach
Abstract:
Extremely low-light text images are common in natural scenes, making scene text detection and recognition challenging. One solution is to enhance these images using low-light image enhancement methods before text extraction. However, previous methods often do not try to particularly address the significance of low-level features, which are crucial for optimal performance on downstream scene text t…
▽ More
Extremely low-light text images are common in natural scenes, making scene text detection and recognition challenging. One solution is to enhance these images using low-light image enhancement methods before text extraction. However, previous methods often do not try to particularly address the significance of low-level features, which are crucial for optimal performance on downstream scene text tasks. Further research is also hindered by the lack of extremely low-light text datasets. To address these limitations, we propose a novel encoder-decoder framework with an edge-aware attention module to focus on scene text regions during enhancement. Our proposed method uses novel text detection and edge reconstruction losses to emphasize low-level scene text features, leading to successful text extraction. Additionally, we present a Supervised Deep Curve Estimation (Supervised-DCE) model to synthesize extremely low-light images based on publicly available scene text datasets such as ICDAR15 (IC15). We also labeled texts in the extremely low-light See In the Dark (SID) and ordinary LOw-Light (LOL) datasets to allow for objective assessment of extremely low-light image enhancement through scene text tasks. Extensive experiments show that our model outperforms state-of-the-art methods in terms of both image quality and scene text metrics on the widely-used LOL, SID, and synthetic IC15 datasets. Code and dataset will be released publicly at https://github.com/chunchet-ng/Text-in-the-Dark.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
Wireless Avionics Intra-Communications: A Survey of Benefits, Challenges, and Solutions
Authors:
Pangun Park,
Piergiuseppe Di Marco,
Junghyo Nah,
Carlo Fischione
Abstract:
In the aeronautics industry, wireless avionics intra-communications have a tremendous potential to improve efficiency and flexibility while reducing the weight, fuel consumption, and maintenance costs over traditional wired avionics systems. This survey starts with an overview of the major benefits and opportunities in the deployment of wireless technologies for critical applications of an aircraf…
▽ More
In the aeronautics industry, wireless avionics intra-communications have a tremendous potential to improve efficiency and flexibility while reducing the weight, fuel consumption, and maintenance costs over traditional wired avionics systems. This survey starts with an overview of the major benefits and opportunities in the deployment of wireless technologies for critical applications of an aircraft. The current state-of-art is presented in terms of system classifications based on data rate demands and transceiver installation locations. We then discuss major technical challenges in the design and realization of the envisioned aircraft applications. Although wireless avionics intra-communication has aspects and requirements similar to mission-critical applications of industrial automation, it also has specific issues such as complex structures, operations, and safety of the aircraft that make this area of research self-standing and challenging. To support the critical operations of an aircraft, existing wireless standards for mission-critical industrial applications are briefly discussed to investigate the applicability of the current solutions. Specifically, IEEE 802.15.4-based protocols and Bluetooth are discussed for low data rate applications, whereas IEEE 802.11- based standards are considered for high data rate applications. Eventually, we propose fundamental schemes in terms of network architecture, protocol, and resource management to support the critical avionics applications and discuss the research directions in this emerging area.
△ Less
Submitted 22 June, 2020;
originally announced June 2020.
-
Quantum Size Effects on the Chemical Sensing Performance of Two-Dimensional Semiconductors
Authors:
Junghyo Nah,
S. Bala Kumar,
Hui Fang,
Yu-Ze Chen,
Elena Plis,
Yu-Lun Chueh,
Sanjay Krishna,
**g Guo,
Ali Javey
Abstract:
We investigate the role of quantum confinement on the performance of gas sensors based on two-dimensional InAs membranes. Pd-decorated InAs membranes configured as H2 sensors are shown to exhibit strong thickness dependence, with ~100x enhancement in the sensor response as the thickness is reduced from 48 to 8 nm. Through detailed experiments and modeling, the thickness scaling trend is attributed…
▽ More
We investigate the role of quantum confinement on the performance of gas sensors based on two-dimensional InAs membranes. Pd-decorated InAs membranes configured as H2 sensors are shown to exhibit strong thickness dependence, with ~100x enhancement in the sensor response as the thickness is reduced from 48 to 8 nm. Through detailed experiments and modeling, the thickness scaling trend is attributed to the quantization of electrons which favorably alters both the position and the transport properties of charge carriers; thus making them more susceptible to surface phenomena.
△ Less
Submitted 19 April, 2012;
originally announced April 2012.
-
Coulomb Drag of Massless Fermions in Graphene
Authors:
Seyoung Kim,
Insun Jo,
Junghyo Nah,
Z. Yao,
S. K. Banerjee,
E. Tutuc
Abstract:
Using a novel structure, consisting of two, independently contacted graphene single layers separated by an ultra-thin dielectric, we experimentally measure the Coulomb drag of massless fermions in graphene. At temperatures higher than 50 K, the Coulomb drag follows a temperature and carrier density dependence consistent with the Fermi liquid regime. As the temperature is reduced, the Coulomb drag…
▽ More
Using a novel structure, consisting of two, independently contacted graphene single layers separated by an ultra-thin dielectric, we experimentally measure the Coulomb drag of massless fermions in graphene. At temperatures higher than 50 K, the Coulomb drag follows a temperature and carrier density dependence consistent with the Fermi liquid regime. As the temperature is reduced, the Coulomb drag exhibits giant fluctuations with an increasing amplitude, thanks to the interplay between coherent transport in the graphene layer and interaction between the two layers.
△ Less
Submitted 6 April, 2011; v1 submitted 11 October, 2010;
originally announced October 2010.
-
Lateral Spin Injection in Germanium Nanowires
Authors:
En-Shao Liu,
Junghyo Nah,
Kamran M. Varahramyan,
Emanuel Tutuc
Abstract:
Electrical injection of spin-polarized electrons into a semiconductor, large spin diffusion length, and an integration friendly platform are desirable ingredients for spin-based devices. Here we demonstrate lateral spin injection and detection in germanium nanowires, by using ferromagnetic metal contacts and tunnel barriers for contact resistance engineering. Using data measured from over 80 sampl…
▽ More
Electrical injection of spin-polarized electrons into a semiconductor, large spin diffusion length, and an integration friendly platform are desirable ingredients for spin-based devices. Here we demonstrate lateral spin injection and detection in germanium nanowires, by using ferromagnetic metal contacts and tunnel barriers for contact resistance engineering. Using data measured from over 80 samples, we map out the contact resistance window for which lateral spin transport is observed, manifestly showing the conductivity matching required for spin injection. Our analysis, based on the spin diffusion theory, indicates that the spin diffusion length is larger than 100 μm in germanium nanowires at 4.2 K.
△ Less
Submitted 24 August, 2010; v1 submitted 19 March, 2010;
originally announced March 2010.
-
Scaling Properties of Ge-SixGe1-x Core-Shell Nanowire Field Effect Transistors
Authors:
Junghyo Nah,
En-Shao Liu,
Kamran M. Varahramyan,
Davood Shahrjerdi,
Sanjay K. Banerjee,
Emanuel Tutuc
Abstract:
We demonstrate the fabrication of high-performance Ge-SixGe1-x core-shell nanowire field-effect transistors with highly doped source and drain, and systematically investigate their scaling properties. Highly doped source and drain regions are realized by low energy boron implantation, which enables efficient carrier injection with a contact resistance much lower than the nanowire resistance. We…
▽ More
We demonstrate the fabrication of high-performance Ge-SixGe1-x core-shell nanowire field-effect transistors with highly doped source and drain, and systematically investigate their scaling properties. Highly doped source and drain regions are realized by low energy boron implantation, which enables efficient carrier injection with a contact resistance much lower than the nanowire resistance. We extract key device parameters, such as intrinsic channel resistance, carrier mobility, effective channel length, and external contact resistance, as well as benchmark the device switching speed and ON/OFF current ratio.
△ Less
Submitted 9 December, 2009;
originally announced December 2009.
-
Large-Area Synthesis of High-Quality and Uniform Graphene Films on Copper Foils
Authors:
Xuesong Li,
Weiwei Cai,
**ho An,
Seyoung Kim,
Junghyo Nah,
Dongxing Yang,
Richard Piner,
Aruna Velamakanni,
Inhwa Jung,
Emanuel Tutuc,
Sanjay K. Banerjee,
Luigi Colombo,
Rodney S. Ruoff
Abstract:
Graphene has been attracting great interest because of its distinctive band structure and physical properties. Today, graphene is limited to small sizes because it is produced mostly by exfoliating graphite. We grew large-area graphene films of the order of centimeters on copper substrates by chemical vapor deposition using methane. The films are predominantly single layer graphene with a small…
▽ More
Graphene has been attracting great interest because of its distinctive band structure and physical properties. Today, graphene is limited to small sizes because it is produced mostly by exfoliating graphite. We grew large-area graphene films of the order of centimeters on copper substrates by chemical vapor deposition using methane. The films are predominantly single layer graphene with a small percentage (less than 5%) of the area having few layers, and are continuous across copper surface steps and grain boundaries. The low solubility of carbon in copper appears to help make this growth process self-limiting. We also developed graphene film transfer processes to arbitrary substrates, and dual-gated field-effect transistors fabricated on Si/SiO2 substrates showed electron mobilities as high as 4050 cm2V-1s-1 at room temperature.
△ Less
Submitted 13 May, 2009; v1 submitted 11 May, 2009;
originally announced May 2009.
-
Realization of a High Mobility Dual-gated Graphene Field Effect Transistor with Al2O3 Dielectric
Authors:
Seyoung Kim,
Junghyo Nah,
Insun Jo,
Davood Shahrjerdi,
Luigi Colombo,
Zhen Yao,
Emanuel Tutuc,
Sanjay K. Banerjee
Abstract:
We fabricate and characterize dual-gated graphene field-effect transistors (FETs) using Al2O3 as top-gate dielectric. We use a thin Al film as a nucleation layer to enable the atomic layer deposition of Al2O3. Our devices show mobility values of over 8,000 cm2/Vs at room temperature, a finding which indicates that the top-gate stack does not significantly increase the carrier scattering, and con…
▽ More
We fabricate and characterize dual-gated graphene field-effect transistors (FETs) using Al2O3 as top-gate dielectric. We use a thin Al film as a nucleation layer to enable the atomic layer deposition of Al2O3. Our devices show mobility values of over 8,000 cm2/Vs at room temperature, a finding which indicates that the top-gate stack does not significantly increase the carrier scattering, and consequently degrade the device characteristics. We propose a device model to fit the experimental data using a single mobility value.
△ Less
Submitted 11 September, 2009; v1 submitted 19 January, 2009;
originally announced January 2009.