-
Adaptable Logical Control for Large Language Models
Authors:
Honghua Zhang,
Po-Nien Kung,
Masahiro Yoshida,
Guy Van den Broeck,
Nanyun Peng
Abstract:
Despite the success of Large Language Models (LLMs) on various tasks following human instructions, controlling model generation at inference time poses a persistent challenge. In this paper, we introduce Ctrl-G, an adaptable framework that facilitates tractable and flexible control of LLM generation to reliably follow logical constraints. Ctrl-G combines any production-ready LLM with a Hidden Mark…
▽ More
Despite the success of Large Language Models (LLMs) on various tasks following human instructions, controlling model generation at inference time poses a persistent challenge. In this paper, we introduce Ctrl-G, an adaptable framework that facilitates tractable and flexible control of LLM generation to reliably follow logical constraints. Ctrl-G combines any production-ready LLM with a Hidden Markov Model, enabling LLM outputs to adhere to logical constraints represented as deterministic finite automata. We show that Ctrl-G, when applied to a TULU2-7B model, outperforms GPT3.5 and GPT4 on the task of interactive text editing: specifically, for the task of generating text insertions/continuations following logical constraints, Ctrl-G achieves over 30% higher satisfaction rate in human evaluation compared to GPT4. When applied to medium-size language models (e.g., GPT2-large), Ctrl-G also beats its counterparts for constrained generation by large margins on standard benchmarks. Additionally, as a proof-of-concept study, we experiment Ctrl-G on the Grade School Math benchmark to assist LLM reasoning, foreshadowing the application of Ctrl-G, as well as other constrained generation approaches, beyond traditional language generation tasks.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
GenEARL: A Training-Free Generative Framework for Multimodal Event Argument Role Labeling
Authors:
Hritik Bansal,
Po-Nien Kung,
P. Jeffrey Brantingham,
Kai-Wei Chang,
Nanyun Peng
Abstract:
Multimodal event argument role labeling (EARL), a task that assigns a role for each event participant (object) in an image is a complex challenge. It requires reasoning over the entire image, the depicted event, and the interactions between various objects participating in the event. Existing models heavily rely on high-quality event-annotated training data to understand the event semantics and st…
▽ More
Multimodal event argument role labeling (EARL), a task that assigns a role for each event participant (object) in an image is a complex challenge. It requires reasoning over the entire image, the depicted event, and the interactions between various objects participating in the event. Existing models heavily rely on high-quality event-annotated training data to understand the event semantics and structures, and they fail to generalize to new event types and domains. In this paper, we propose GenEARL, a training-free generative framework that harness the power of the modern generative models to understand event task descriptions given image contexts to perform the EARL task. Specifically, GenEARL comprises two stages of generative prompting with a frozen vision-language model (VLM) and a frozen large language model (LLM). First, a generative VLM learns the semantics of the event argument roles and generates event-centric object descriptions based on the image. Subsequently, a LLM is prompted with the generated object descriptions with a predefined template for EARL (i.e., assign an object with an event argument role). We show that GenEARL outperforms the contrastive pretraining (CLIP) baseline by 9.4% and 14.2% accuracy for zero-shot EARL on the M2E2 and SwiG datasets, respectively. In addition, we outperform CLIP-Event by 22% precision on M2E2 dataset. The framework also allows flexible adaptation and generalization to unseen domains.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
Improving Event Definition Following For Zero-Shot Event Detection
Authors:
Zefan Cai,
Po-Nien Kung,
Ashima Suvarna,
Mingyu Derek Ma,
Hritik Bansal,
Baobao Chang,
P. Jeffrey Brantingham,
Wei Wang,
Nanyun Peng
Abstract:
Existing approaches on zero-shot event detection usually train models on datasets annotated with known event types, and prompt them with unseen event definitions. These approaches yield sporadic successes, yet generally fall short of expectations. In this work, we aim to improve zero-shot event detection by training models to better follow event definitions. We hypothesize that a diverse set of ev…
▽ More
Existing approaches on zero-shot event detection usually train models on datasets annotated with known event types, and prompt them with unseen event definitions. These approaches yield sporadic successes, yet generally fall short of expectations. In this work, we aim to improve zero-shot event detection by training models to better follow event definitions. We hypothesize that a diverse set of event types and definitions are the key for models to learn to follow event definitions while existing event extraction datasets focus on annotating many high-quality examples for a few event types. To verify our hypothesis, we construct an automatically generated Diverse Event Definition (DivED) dataset and conduct comparative studies. Our experiments reveal that a large number of event types (200) and diverse event definitions can significantly boost event extraction performance; on the other hand, the performance does not scale with over ten examples per event type. Beyond scaling, we incorporate event ontology information and hard-negative samples during training, further boosting the performance. Based on these findings, we fine-tuned a LLaMA-2-7B model on our DivED dataset, yielding performance that surpasses SOTA large language models like GPT-3.5 across three open benchmarks on zero-shot event detection.
△ Less
Submitted 4 March, 2024;
originally announced March 2024.
-
Active Instruction Tuning: Improving Cross-Task Generalization by Training on Prompt Sensitive Tasks
Authors:
Po-Nien Kung,
Fan Yin,
Di Wu,
Kai-Wei Chang,
Nanyun Peng
Abstract:
Instruction tuning (IT) achieves impressive zero-shot generalization results by training large language models (LLMs) on a massive amount of diverse tasks with instructions. However, how to select new tasks to improve the performance and generalizability of IT models remains an open question. Training on all existing tasks is impractical due to prohibiting computation requirements, and randomly se…
▽ More
Instruction tuning (IT) achieves impressive zero-shot generalization results by training large language models (LLMs) on a massive amount of diverse tasks with instructions. However, how to select new tasks to improve the performance and generalizability of IT models remains an open question. Training on all existing tasks is impractical due to prohibiting computation requirements, and randomly selecting tasks can lead to suboptimal performance. In this work, we propose active instruction tuning based on prompt uncertainty, a novel framework to identify informative tasks, and then actively tune the models on the selected tasks. We represent the informativeness of new tasks with the disagreement of the current model outputs over perturbed prompts. Our experiments on NIV2 and Self-Instruct datasets demonstrate that our method consistently outperforms other baseline strategies for task selection, achieving better out-of-distribution generalization with fewer training tasks. Additionally, we introduce a task map that categorizes and diagnoses tasks based on prompt uncertainty and prediction probability. We discover that training on ambiguous (prompt-uncertain) tasks improves generalization while training on difficult (prompt-certain and low-probability) tasks offers no benefit, underscoring the importance of task selection for instruction tuning.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
MIDDAG: Where Does Our News Go? Investigating Information Diffusion via Community-Level Information Pathways
Authors:
Mingyu Derek Ma,
Alexander K. Taylor,
Nuan Wen,
Yanchen Liu,
Po-Nien Kung,
Wenna Qin,
Shicheng Wen,
Azure Zhou,
Diyi Yang,
Xuezhe Ma,
Nanyun Peng,
Wei Wang
Abstract:
We present MIDDAG, an intuitive, interactive system that visualizes the information propagation paths on social media triggered by COVID-19-related news articles accompanied by comprehensive insights, including user/community susceptibility level, as well as events and popular opinions raised by the crowd while propagating the information. Besides discovering information flow patterns among users,…
▽ More
We present MIDDAG, an intuitive, interactive system that visualizes the information propagation paths on social media triggered by COVID-19-related news articles accompanied by comprehensive insights, including user/community susceptibility level, as well as events and popular opinions raised by the crowd while propagating the information. Besides discovering information flow patterns among users, we construct communities among users and develop the propagation forecasting capability, enabling tracing and understanding of how information is disseminated at a higher level.
△ Less
Submitted 20 February, 2024; v1 submitted 3 October, 2023;
originally announced October 2023.
-
LONER: LiDAR Only Neural Representations for Real-Time SLAM
Authors:
Seth Isaacson,
Pou-Chun Kung,
Mani Ramanagopal,
Ram Vasudevan,
Katherine A. Skinner
Abstract:
This paper proposes LONER, the first real-time LiDAR SLAM algorithm that uses a neural implicit scene representation. Existing implicit map** methods for LiDAR show promising results in large-scale reconstruction, but either require groundtruth poses or run slower than real-time. In contrast, LONER uses LiDAR data to train an MLP to estimate a dense map in real-time, while simultaneously estimat…
▽ More
This paper proposes LONER, the first real-time LiDAR SLAM algorithm that uses a neural implicit scene representation. Existing implicit map** methods for LiDAR show promising results in large-scale reconstruction, but either require groundtruth poses or run slower than real-time. In contrast, LONER uses LiDAR data to train an MLP to estimate a dense map in real-time, while simultaneously estimating the trajectory of the sensor. To achieve real-time performance, this paper proposes a novel information-theoretic loss function that accounts for the fact that different regions of the map may be learned to varying degrees throughout online training. The proposed method is evaluated qualitatively and quantitatively on two open-source datasets. This evaluation illustrates that the proposed loss function converges faster and leads to more accurate geometry reconstruction than other loss functions used in depth-supervised neural implicit frameworks. Finally, this paper shows that LONER estimates trajectories competitively with state-of-the-art LiDAR SLAM methods, while also producing dense maps competitive with existing real-time implicit map** methods that use groundtruth poses.
△ Less
Submitted 23 March, 2024; v1 submitted 10 September, 2023;
originally announced September 2023.
-
STAR: Boosting Low-Resource Information Extraction by Structure-to-Text Data Generation with Large Language Models
Authors:
Mingyu Derek Ma,
Xiaoxuan Wang,
Po-Nien Kung,
P. Jeffrey Brantingham,
Nanyun Peng,
Wei Wang
Abstract:
Information extraction tasks such as event extraction require an in-depth understanding of the output structure and sub-task dependencies. They heavily rely on task-specific training data in the form of (passage, target structure) pairs to obtain reasonable performance. However, obtaining such data through human annotation is costly, leading to a pressing need for low-resource information extracti…
▽ More
Information extraction tasks such as event extraction require an in-depth understanding of the output structure and sub-task dependencies. They heavily rely on task-specific training data in the form of (passage, target structure) pairs to obtain reasonable performance. However, obtaining such data through human annotation is costly, leading to a pressing need for low-resource information extraction approaches that require minimal human labeling for real-world applications. Fine-tuning supervised models with synthesized training data would be a generalizable method, but the existing data generation methods either still rely on large-scale ground-truth data or cannot be applied to complicated IE tasks due to their poor performance. To address these challenges, we propose STAR, a data generation method that leverages Large Language Models (LLMs) to synthesize data instances given limited seed demonstrations, thereby boosting low-resource information extraction performance. Our approach involves generating target structures (Y) followed by generating passages (X), all accomplished with the aid of LLMs. We design fine-grained step-by-step instructions to obtain the initial data instances. We further reduce errors and improve data quality through self-reflection error identification and self-refinement with iterative revision. Our experiments show that the data generated by STAR significantly improve the performance of low-resource event extraction and relation extraction tasks, even surpassing the effectiveness of human-curated data. Human assessment of the data quality shows STAR-generated data exhibits higher passage quality and better align with the task definitions compared with the human-curated data.
△ Less
Submitted 20 February, 2024; v1 submitted 24 May, 2023;
originally announced May 2023.
-
Do Models Really Learn to Follow Instructions? An Empirical Study of Instruction Tuning
Authors:
Po-Nien Kung,
Nanyun Peng
Abstract:
Recent works on instruction tuning (IT) have achieved great performance with zero-shot generalizability to unseen tasks. With additional context (e.g., task definition, examples) provided to models for fine-tuning, they achieved much higher performance than untuned models. Despite impressive performance gains, what models learn from IT remains understudied. In this work, we analyze how models util…
▽ More
Recent works on instruction tuning (IT) have achieved great performance with zero-shot generalizability to unseen tasks. With additional context (e.g., task definition, examples) provided to models for fine-tuning, they achieved much higher performance than untuned models. Despite impressive performance gains, what models learn from IT remains understudied. In this work, we analyze how models utilize instructions during IT by comparing model training with altered vs. original instructions. Specifically, we create simplified task definitions by removing all semantic components and only leaving the output space information, and delusive examples that contain incorrect input-output map**. Our experiments show that models trained on simplified task definition or delusive examples can achieve comparable performance to the ones trained on the original instructions and examples. Furthermore, we introduce a random baseline to perform zeroshot classification tasks, and find it achieves similar performance (42.6% exact-match) as IT does (43% exact-match) in low resource setting, while both methods outperform naive T5 significantly (30% per exact-match). Our analysis provides evidence that the impressive performance gain of current IT models can come from picking up superficial patterns, such as learning the output format and guessing. Our study highlights the urgent need for more reliable IT methods and evaluation.
△ Less
Submitted 25 May, 2023; v1 submitted 18 May, 2023;
originally announced May 2023.
-
Radar Occupancy Prediction with Lidar Supervision while Preserving Long-Range Sensing and Penetrating Capabilities
Authors:
Pou-Chun Kung,
Chieh-Chih Wang,
Wen-Chieh Lin
Abstract:
Radar shows great potential for autonomous driving by accomplishing long-range sensing under diverse weather conditions. But radar is also a particularly challenging sensing modality due to the radar noises. Recent works have made enormous progress in classifying free and occupied spaces in radar images by leveraging lidar label supervision. However, there are still several unsolved issues. Firstl…
▽ More
Radar shows great potential for autonomous driving by accomplishing long-range sensing under diverse weather conditions. But radar is also a particularly challenging sensing modality due to the radar noises. Recent works have made enormous progress in classifying free and occupied spaces in radar images by leveraging lidar label supervision. However, there are still several unsolved issues. Firstly, the sensing distance of the results is limited by the sensing range of lidar. Secondly, the performance of the results is degenerated by lidar due to the physical sensing discrepancies between the two sensors. For example, some objects visible to lidar are invisible to radar, and some objects occluded in lidar scans are visible in radar images because of the radar's penetrating capability. These sensing differences cause false positive and penetrating capability degeneration, respectively.
In this paper, we propose training data preprocessing and polar sliding window inference to solve the issues. The data preprocessing aims to reduce the effect caused by radar-invisible measurements in lidar scans. The polar sliding window inference aims to solve the limited sensing range issue by applying a near-range trained network to the long-range region. Instead of using common Cartesian representation, we propose to use polar representation to reduce the shape dissimilarity between long-range and near-range data. We find that extending a near-range trained network to long-range region inference in the polar space has 4.2 times better IoU than in Cartesian space. Besides, the polar sliding window inference can preserve the radar penetrating capability by changing the viewpoint of the inference region, which makes some occluded measurements seem non-occluded for a pretrained network.
△ Less
Submitted 17 January, 2022; v1 submitted 8 December, 2021;
originally announced December 2021.
-
Nanoscale Raman Characterization of a 2D Semiconductor Lateral Heterostructure Interface
Authors:
Sourav Garg,
J. Pierce Fix,
Andrey V. Krayev,
Connor Flanery,
Michael Colgrove,
Audrey R. Sulkanen,
Minyuan Wang,
Gang-Yu Liu,
Nicholas J. Borys,
Patrick Kung
Abstract:
The nature of the interface in lateral heterostructures of 2D monolayer semiconductors including its composition, size, and heterogeneity critically impacts the functionalities it engenders on the 2D system for next-generation optoelectronics. Here, we use tip-enhanced Raman scattering (TERS) to characterize the interface in a single-layer MoS2/WS2 lateral heterostructure with a spatial resolution…
▽ More
The nature of the interface in lateral heterostructures of 2D monolayer semiconductors including its composition, size, and heterogeneity critically impacts the functionalities it engenders on the 2D system for next-generation optoelectronics. Here, we use tip-enhanced Raman scattering (TERS) to characterize the interface in a single-layer MoS2/WS2 lateral heterostructure with a spatial resolution of 50 nm. Resonant and non-resonant TERS spectroscopies reveal that the interface is alloyed with a size that varies over an order of magnitude-from 50-600 nm-within a single crystallite. Nanoscale imaging of the continuous interfacial evolution of the resonant and non-resonant Raman spectra enables the deconvolution of defect-activation, resonant enhancement, and material composition for several vibrational modes in single-layer MoS2, MoxW1-xS2, and WS2. The results demonstrate the capabilities of nanoscale TERS spectroscopy to elucidate macroscopic structure-property relationships in 2D materials and to characterize lateral interfaces of 2D systems on length scales that are imperative for devices.
△ Less
Submitted 29 October, 2021;
originally announced November 2021.
-
Multi-Task Learning for Situated Multi-Domain End-to-End Dialogue Systems
Authors:
Po-Nien Kung,
Chung-Cheng Chang,
Tse-Hsuan Yang,
Hsin-Kai Hsu,
Yu-Jia Liou,
Yun-Nung Chen
Abstract:
Task-oriented dialogue systems have been a promising area in the NLP field. Previous work showed the effectiveness of using a single GPT-2 based model to predict belief states and responses via causal language modeling. In this paper, we leverage multi-task learning techniques to train a GPT-2 based model on a more challenging dataset with multiple domains, multiple modalities, and more diversity…
▽ More
Task-oriented dialogue systems have been a promising area in the NLP field. Previous work showed the effectiveness of using a single GPT-2 based model to predict belief states and responses via causal language modeling. In this paper, we leverage multi-task learning techniques to train a GPT-2 based model on a more challenging dataset with multiple domains, multiple modalities, and more diversity in output formats.
Using only a single model, our method achieves better performance on all sub-tasks, across domains, compared to task and domain-specific models. Furthermore, we evaluated several proposed strategies for GPT-2 based dialogue systems with comprehensive ablation studies, showing that all techniques can further improve the performance.
△ Less
Submitted 11 October, 2021;
originally announced October 2021.
-
Inconsequential results on the Merino-Welsh conjecture for Tutte polynomials
Authors:
Joseph P. S. Kung
Abstract:
The Merino-Welsh conjectures say that subject to conditions, there is an inequality among the Tutte-polynomial evaluations $T(M;2,0)$, $T(M;0,2)$, and $T(M;1,1)$. We present three results on a Merino-Welsh conjecture. These results are "inconsequential" in the sense that although they imply a version of the conjecture for many matroids, they seem to be dead ends.
The Merino-Welsh conjectures say that subject to conditions, there is an inequality among the Tutte-polynomial evaluations $T(M;2,0)$, $T(M;0,2)$, and $T(M;1,1)$. We present three results on a Merino-Welsh conjecture. These results are "inconsequential" in the sense that although they imply a version of the conjecture for many matroids, they seem to be dead ends.
△ Less
Submitted 19 May, 2021; v1 submitted 4 May, 2021;
originally announced May 2021.
-
The $\barγ$-frame for Tutte polynomials of matroids
Authors:
Joseph P. S. Kung
Abstract:
Specializing the $γ$-basis for the vector space $\mathcal{G}(n,r)$ spanned by the set of symbols on bit sequences with $r$ $1$'s and $n-r$ $0$'s, we obtain a frame or spanning set for the vector space $\mathcal{T}(n,r)$ spanned by Tutte polynomials of matroids having rank $r$ and size $n$. Every Tutte polynomial can be expanded as a linear combination with non-negative integer coefficients of elem…
▽ More
Specializing the $γ$-basis for the vector space $\mathcal{G}(n,r)$ spanned by the set of symbols on bit sequences with $r$ $1$'s and $n-r$ $0$'s, we obtain a frame or spanning set for the vector space $\mathcal{T}(n,r)$ spanned by Tutte polynomials of matroids having rank $r$ and size $n$. Every Tutte polynomial can be expanded as a linear combination with non-negative integer coefficients of elements in this frame. We give explicit formulas for the elements in this frame. These formulas combine to give an expansion of the Tutte polynomial with coefficients obtained by summing numerical invariants over all flats with a given rank and size.
△ Less
Submitted 5 June, 2021; v1 submitted 8 April, 2021;
originally announced April 2021.
-
A Normal Distribution Transform-Based Radar Odometry Designed For Scanning and Automotive Radars
Authors:
Pou-Chun Kung,
Chieh-Chih Wang,
Wen-Chieh Lin
Abstract:
Existing radar sensors can be classified into automotive and scanning radars. While most radar odometry (RO) methods are only designed for a specific type of radar, our RO method adapts to both scanning and automotive radars. Our RO is simple yet effective, where the pipeline consists of thresholding, probabilistic submap building, and an NDT-based radar scan matching. The proposed RO has been tes…
▽ More
Existing radar sensors can be classified into automotive and scanning radars. While most radar odometry (RO) methods are only designed for a specific type of radar, our RO method adapts to both scanning and automotive radars. Our RO is simple yet effective, where the pipeline consists of thresholding, probabilistic submap building, and an NDT-based radar scan matching. The proposed RO has been tested on two public radar datasets: the Oxford Radar RobotCar dataset and the nuScenes dataset, which provide scanning and automotive radar data respectively. The results show that our approach surpasses state-of-the-art RO using either automotive or scanning radar by reducing translational error by 51% and 30%, respectively, and rotational error by 17% and 29%, respectively. Besides, we show that our RO achieves centimeter-level accuracy as lidar odometry, and automotive and scanning RO have similar accuracy.
△ Less
Submitted 30 March, 2023; v1 submitted 14 March, 2021;
originally announced March 2021.
-
Algebra and valuations related to the Tutte polynomial
Authors:
Michael J. Falk,
Joseph P. S. Kung
Abstract:
This is a chapter destined for the book "Handbook of the Tutte Polynomial". The chapter is a composite. The first part is a brief introduction to Orlik-Solomon algebras. The second part sketches the theory of evaluative functions on matroid base polytopes and in particular, the G-invariant (as the subject is known late 2015). A third very short section is on Hopf-algebra or coalgebra structures in…
▽ More
This is a chapter destined for the book "Handbook of the Tutte Polynomial". The chapter is a composite. The first part is a brief introduction to Orlik-Solomon algebras. The second part sketches the theory of evaluative functions on matroid base polytopes and in particular, the G-invariant (as the subject is known late 2015). A third very short section is on Hopf-algebra or coalgebra structures in Tutte polynomial theory.
△ Less
Submitted 23 November, 2017;
originally announced November 2017.
-
Deep Poisson Factorization Machines: factor analysis for map** behaviors in journalist ecosystem
Authors:
Pau Perng-Hwa Kung
Abstract:
Newsroom in online ecosystem is difficult to untangle. With prevalence of social media, interactions between journalists and individuals become visible, but lack of understanding to inner processing of information feedback loop in public sphere leave most journalists baffled. Can we provide an organized view to characterize journalist behaviors on individual level to know better of the ecosystem?…
▽ More
Newsroom in online ecosystem is difficult to untangle. With prevalence of social media, interactions between journalists and individuals become visible, but lack of understanding to inner processing of information feedback loop in public sphere leave most journalists baffled. Can we provide an organized view to characterize journalist behaviors on individual level to know better of the ecosystem? To this end, I propose Poisson Factorization Machine (PFM), a Bayesian analogue to matrix factorization that assumes Poisson distribution for generative process. The model generalizes recent studies on Poisson Matrix Factorization to account temporal interaction which involves tensor-like structure, and label information. Two inference procedures are designed, one based on batch variational EM and another stochastic variational inference scheme that efficiently scales with data size. An important novelty in this note is that I show how to stack layers of PFM to introduce a deep architecture. This work discusses some potential results applying the model and explains how such latent factors may be useful for analyzing latent behaviors for data exploration.
△ Less
Submitted 29 December, 2017; v1 submitted 17 December, 2015;
originally announced December 2015.
-
Measuring Responsiveness in the Online Public Sphere for the 2016 U.S. Election: Concepts
Authors:
Pau Perng-Hwa Kung,
Deb Roy
Abstract:
The election narrative is formed under the competitions of ideas among critical players involving politicians, news media, public influentials, and the general public. Untangling the complex process of narrative formation, however, is no easy task due to implicit influences among the key players. This paper outlines a conceptual framework to untangle this complex process. We propose the problem of…
▽ More
The election narrative is formed under the competitions of ideas among critical players involving politicians, news media, public influentials, and the general public. Untangling the complex process of narrative formation, however, is no easy task due to implicit influences among the key players. This paper outlines a conceptual framework to untangle this complex process. We propose the problem of measuring "responsiveness" that quantifies a player's influence on another given a specific election topic over time. In particular, we make use of multivariate Hawkes Process to infer the influence network between pairs of election players. We demonstrate an early version system of analytic pipeline of online public sphere discussions from data ingestion, influence inference, to visualization. The paper concludes by showcasing some preliminary results based on Twitter and news media election-related contents from July to October 2015 and discussing plans for future research.
△ Less
Submitted 9 December, 2015; v1 submitted 18 November, 2015;
originally announced November 2015.
-
The $\mathcal{G}$-invariant and catenary data of a matroid
Authors:
Joseph E. Bonin,
Joseph P. S. Kung
Abstract:
The catenary data of a matroid $M$ of rank $r$ on $n$ elements is the vector $(ν(M;a_0,a_1,\ldots,a_r))$, indexed by compositions $(a_0,a_1,\ldots,a_r)$, where $a_0 \geq 0$,\, $a_i > 0$ for $i \geq 1$, and $a_0+ a_1 + \cdots + a_r = n$, with the coordinate $ν(M;a_0,a_1, \ldots,a_r)$ equal to the number of maximal chains or flags $(X_0,X_1, \ldots,X_r)$ of flats or closed sets such that $X_i$ has r…
▽ More
The catenary data of a matroid $M$ of rank $r$ on $n$ elements is the vector $(ν(M;a_0,a_1,\ldots,a_r))$, indexed by compositions $(a_0,a_1,\ldots,a_r)$, where $a_0 \geq 0$,\, $a_i > 0$ for $i \geq 1$, and $a_0+ a_1 + \cdots + a_r = n$, with the coordinate $ν(M;a_0,a_1, \ldots,a_r)$ equal to the number of maximal chains or flags $(X_0,X_1, \ldots,X_r)$ of flats or closed sets such that $X_i$ has rank $i$,\, $|X_0| = a_0$, and $|X_i - X_{i-1}| = a_i$. We show that the catenary data of $M$ contains the same information about $M$ as its $\mathcal{G}$-invariant, which was defined by H. Derksen [\emph{J.\ Algebr.\ Combin.}\ 30 (2009) 43--86]. The Tutte polynomial is a specialization of the $\mathcal{G}$-invariant. We show that many known results for the Tutte polynomial have analogs for the $\mathcal{G}$-invariant. In particular, we show that for many matroid constructions, the $\mathcal{G}$-invariant of the construction can be calculated from the $\mathcal{G}$-invariants of the constituents and that the $\mathcal{G}$-invariant of a matroid can be calculated from its size, the isomorphism class of the lattice of cyclic flats with lattice elements labeled by the rank and size of the underlying set. We also show that the number of flats and cyclic flats of a given rank and size can be derived from the $\mathcal{G}$-invariant, that the $\mathcal{G}$-invariant of $M$ is reconstructible from the deck of $\mathcal{G}$-invariants of restrictions of $M$ to its copoints, and that, apart from free extensions and coextensions, one can detect whether a matroid is a free product from its $\mathcal{G}$-invariant.
△ Less
Submitted 17 January, 2017; v1 submitted 2 October, 2015;
originally announced October 2015.
-
Maximum size binary matroids with no AG(3,2)-minor are graphic
Authors:
Joseph P. S. Kung,
Dillon Mayhew,
Irene Pivotto,
Gordon F. Royle
Abstract:
We prove that the maximum size of a simple binary matroid of rank $r \geq 5$ with no AG(3,2)-minor is $\binom{r+1}{2}$ and characterise those matroids achieving this bound. When $r \geq 6$, the graphic matroid $M(K_{r+1})$ is the unique matroid meeting the bound, but there are a handful of smaller examples. In addition, we determine the size function for non-regular simple binary matroids with no…
▽ More
We prove that the maximum size of a simple binary matroid of rank $r \geq 5$ with no AG(3,2)-minor is $\binom{r+1}{2}$ and characterise those matroids achieving this bound. When $r \geq 6$, the graphic matroid $M(K_{r+1})$ is the unique matroid meeting the bound, but there are a handful of smaller examples. In addition, we determine the size function for non-regular simple binary matroids with no AG(3,2)-minor and characterise the matroids of maximum size for each rank.
△ Less
Submitted 8 April, 2013;
originally announced April 2013.
-
Growth Morphology of Boron Doped Single Crystal Diamond
Authors:
S. K. Karna,
Y. K. Vohra,
P. Kung,
S. T. Weir
Abstract:
Boron-doped single crystal diamond films were grown homoepitaxially on synthetic (100) Type Ib diamond substrates using microwave plasma assisted chemical vapor deposition. A modification in surface morphology of the film with increasing boron concentration in the plasma has been observed using atomic force microscopy. Use of nitrogen during boron do** has been found to improve the surface morph…
▽ More
Boron-doped single crystal diamond films were grown homoepitaxially on synthetic (100) Type Ib diamond substrates using microwave plasma assisted chemical vapor deposition. A modification in surface morphology of the film with increasing boron concentration in the plasma has been observed using atomic force microscopy. Use of nitrogen during boron do** has been found to improve the surface morphology and the growth rate of films but it lowers the electrical conductivity of the film. The Raman spectra indicated a zone center optical phonon mode along with a few additional bands at the lower wavenumber regions. The change in the peak profile of the zone center optical phonon mode and its downshift were observed with the increasing boron content in the film. However, shrinkage and upshift of Raman line was observed in the film that was grown in presence of nitrogen along with diborane in process gas.
△ Less
Submitted 5 November, 2013; v1 submitted 31 January, 2013;
originally announced January 2013.
-
Semidirect sums of matroids
Authors:
Joseph E. Bonin,
Joseph P. S. Kung
Abstract:
For matroids M and N on disjoint sets S and T, a semidirect sum of M and N is a matroid K on the union of S and T that, like the direct sum and the free product, has the restriction of K to S equal to M and the contraction of K to T equal to N. We abstract a matrix construction to get a general matroid construction: the matroid union of any rank-preserving extension of M on the union of S and T wi…
▽ More
For matroids M and N on disjoint sets S and T, a semidirect sum of M and N is a matroid K on the union of S and T that, like the direct sum and the free product, has the restriction of K to S equal to M and the contraction of K to T equal to N. We abstract a matrix construction to get a general matroid construction: the matroid union of any rank-preserving extension of M on the union of S and T with the direct sum of N and the rank-0 matroid on S is a semidirect sum of M and N. We study principal sums in depth; these are such matroid unions where the extension of M has each element of T added either as a loop or freely on a fixed flat of M. A second construction of semidirect sums, defined by a Higgs lift, also specializes to principal sums. We also explore what can be deduced if M and N, or certain of their semidirect sums, are transversal or fundamental transversal matroids.
△ Less
Submitted 1 October, 2012;
originally announced October 2012.
-
Rook and queen paths with boundaries
Authors:
Joseph P. S. Kung,
Anna de Mier
Abstract:
A rook path is a path on lattice points in the plane in which any proper horizontal step to the right or vertical step north is allowed. If, in addition, one allow bishop steps, that is, proper diagonal steps of slope 1, then one has queen paths. A rook or queen path is Catalan if it starts at the origin and stays strictly to the left of the line y = x-1. We give explicit formulas for the ordinary…
▽ More
A rook path is a path on lattice points in the plane in which any proper horizontal step to the right or vertical step north is allowed. If, in addition, one allow bishop steps, that is, proper diagonal steps of slope 1, then one has queen paths. A rook or queen path is Catalan if it starts at the origin and stays strictly to the left of the line y = x-1. We give explicit formulas for the ordinary generating function of the number of Catalan rook and queen paths finishing at $(n,n).$ These generating functions are algebraic; indeed, they satisfy quadratic equations. In the second version, we also consider paths with "spider steps", that is, proper steps on lattice points with slope strictly greater than one. In the third version, we give step-enumerator versions of our results.
△ Less
Submitted 3 July, 2012; v1 submitted 8 September, 2011;
originally announced September 2011.
-
Characterizations of transversal and fundamental transversal matroids
Authors:
Joseph E. Bonin,
Joseph P. S. Kung,
Anna de Mier
Abstract:
A result of Mason, as refined by Ingleton, characterizes transversal matroids as the matroids that satisfy a set of inequalities that relate the ranks of intersections and unions of nonempty sets of cyclic flats. We prove counterparts, for fundamental transversal matroids, of this and other characterizations of transversal matroids. In particular, we show that fundamental transversal matroids are…
▽ More
A result of Mason, as refined by Ingleton, characterizes transversal matroids as the matroids that satisfy a set of inequalities that relate the ranks of intersections and unions of nonempty sets of cyclic flats. We prove counterparts, for fundamental transversal matroids, of this and other characterizations of transversal matroids. In particular, we show that fundamental transversal matroids are precisely the matroids that yield equality in Mason's inequalities and we deduce a characterization of fundamental transversal matroids due to Brylawski from this simpler characterization.
△ Less
Submitted 17 September, 2010;
originally announced September 2010.
-
Congruence conditions, parcels, and Tutte polynomials of graphs and matroids
Authors:
Joseph P. S. Kung
Abstract:
Let $G$ be a matrix and $M(G)$ be the matroid defined by linear dependence on the set $E$ of column vectors of $G.$ Roughly speaking, a parcel is a subset of pairs $(f,g)$ of functions defined on $E$ to an Abelian group $A$ satisfying a coboundary condition (that $f-g$ is a flow over $A$ relative to $G$) and a congruence condition (that the size of the supports of $f$ and $g$ satisfy some congruen…
▽ More
Let $G$ be a matrix and $M(G)$ be the matroid defined by linear dependence on the set $E$ of column vectors of $G.$ Roughly speaking, a parcel is a subset of pairs $(f,g)$ of functions defined on $E$ to an Abelian group $A$ satisfying a coboundary condition (that $f-g$ is a flow over $A$ relative to $G$) and a congruence condition (that the size of the supports of $f$ and $g$ satisfy some congruence condition modulo an integer). We prove several theorems of the form: a linear combination of sizes of parcels, with coefficients roots of unity, equals an evaluation of the Tutte polynomial of $M(G)$ at a point $(λ-1,x-1)$ on the complex hyperbola $(λ- 1)(x-1) = |A|.$
△ Less
Submitted 3 December, 2011; v1 submitted 1 July, 2010;
originally announced July 2010.
-
Convolution-multiplication identities for Tutte polynomials of matroids
Authors:
Joseph P. S. Kung
Abstract:
We give a general multiplication-convolution identity for the multivariate and bivariate rank generating polynomial of a matroid. The bivariate rank generating polynomial is transformable to and from the Tutte polynomial by simple algebraic operations. Several identities, almost all already known in some form, are specializations of this identity. Combinatorial or probabilistic interpretations a…
▽ More
We give a general multiplication-convolution identity for the multivariate and bivariate rank generating polynomial of a matroid. The bivariate rank generating polynomial is transformable to and from the Tutte polynomial by simple algebraic operations. Several identities, almost all already known in some form, are specializations of this identity. Combinatorial or probabilistic interpretations are given for the specialized identities.
△ Less
Submitted 11 September, 2009;
originally announced September 2009.
-
Graphs whose flow polynomials have only integral roots
Authors:
Joseph P. S. Kung,
Gordon F. Royle
Abstract:
We show if the flow polynomial of a bridgeless graph G has only integral roots, then G is the dual graph to a planar chordal graph. We also show that for 3-connected cubic graphs, the same conclusion holds under the weaker hypothesis that it has only real flow roots. Expressed in the language of matroid theory, this result says that the cographic matroids with only integral characteristic roots…
▽ More
We show if the flow polynomial of a bridgeless graph G has only integral roots, then G is the dual graph to a planar chordal graph. We also show that for 3-connected cubic graphs, the same conclusion holds under the weaker hypothesis that it has only real flow roots. Expressed in the language of matroid theory, this result says that the cographic matroids with only integral characteristic roots are the cycle matroids of planar chordal graphs.
△ Less
Submitted 10 September, 2009; v1 submitted 2 August, 2009;
originally announced August 2009.
-
Lattice and Schroder paths with periodic boundaries
Authors:
Joseph P. S. Kung,
Anna de Mier,
Xinyu Sun,
Catherine H. Yan
Abstract:
We consider paths in the plane with $(1,0),$ $(0,1),$ and $(a,b)$-steps that start at the origin, end at height $n,$ and stay to the left of a given non-decreasing right boundary. We show that if the boundary is periodic and has slope at most $b/a,$ then the ordinary generating function for the number of such paths ending at height $n$ is algebraic. Our argument is in two parts. We use a simple…
▽ More
We consider paths in the plane with $(1,0),$ $(0,1),$ and $(a,b)$-steps that start at the origin, end at height $n,$ and stay to the left of a given non-decreasing right boundary. We show that if the boundary is periodic and has slope at most $b/a,$ then the ordinary generating function for the number of such paths ending at height $n$ is algebraic. Our argument is in two parts. We use a simple combinatorial decomposition to obtain an Appell relation or ``umbral'' generating function, in which the power $z^n$ is replaced by a power series of the form $z^n φ_n(z),$ where $φ_n(0) = 1.$ Then we convert (in an explicit way) the umbral generating function to an ordinary generating function by solving a system of linear equations and a polynomial equation. This conversion implies that the ordinary generating function is algebraic.
△ Less
Submitted 27 September, 2007; v1 submitted 11 September, 2007;
originally announced September 2007.
-
Derivation modules of orthogonal duals of hyperplane arrangements
Authors:
Joseph P. S. Kung,
Hal Schenck
Abstract:
Let A be an n by d matrix having full rank n. An orthogonal dual A^{\perp} of A is a (d-n) by d matrix of rank (d-n) such that every row of A^{\perp} is orthogonal (under the usual dot product) to every row of A. We define the orthogonal dual for arrangements by identifying an essential (central) arrangement of d hyperplanes in n-dimensional space with the n by d matrix of coefficients of the ho…
▽ More
Let A be an n by d matrix having full rank n. An orthogonal dual A^{\perp} of A is a (d-n) by d matrix of rank (d-n) such that every row of A^{\perp} is orthogonal (under the usual dot product) to every row of A. We define the orthogonal dual for arrangements by identifying an essential (central) arrangement of d hyperplanes in n-dimensional space with the n by d matrix of coefficients of the homogeneous linear forms for which the hyperplanes are kernels. If n is at least 5, we show that if the matroid (or the intersection lattice) of an n-dimensional essential arrangement A contains a modular copoint whose complement spans, then the derivation module of the orthogonally dual arrangement \A^{\perp} has projective dimension at least [n(n+2)/4] - 3,([ ] denotes ceiling).
△ Less
Submitted 7 April, 2006;
originally announced April 2006.
-
Spectral measurement of the Hall angle response in normal state cuprate superconductors
Authors:
M. Grayson,
L. B. Rigal,
D. C. Schmadel,
H. D. Drew,
P. -J. Kung
Abstract:
We measure the temperature and frequency dependence of the complex Hall angle for normal state YBa$_2$Cu$_3$O$_7$ films from dc to far-infrared frequencies (20-250 cm$^{-1}$) using a new modulated polarization technique. We determine that the functional dependence of the Hall angle on scattering does not fit the expected Lorentzian response. We find spectral evidence supporting models of the Hal…
▽ More
We measure the temperature and frequency dependence of the complex Hall angle for normal state YBa$_2$Cu$_3$O$_7$ films from dc to far-infrared frequencies (20-250 cm$^{-1}$) using a new modulated polarization technique. We determine that the functional dependence of the Hall angle on scattering does not fit the expected Lorentzian response. We find spectral evidence supporting models of the Hall effect where the scattering $Γ_H$ is linear in T, suggesting that a single relaxation rate, linear in temperature, governs transport in the cuprates.
△ Less
Submitted 29 May, 2002; v1 submitted 31 August, 2001;
originally announced August 2001.
-
Infrared Hall effect in high Tc superconductors: Evidence for non-Fermi liquid Hall scattering
Authors:
J. Cerne,
M. Grayson,
D. C. Schmadel,
G. S. Jenkins,
H. D. Drew,
R. Hughes,
J. S. Preston,
P. -J. Kung
Abstract:
Infrared (20-120 cm-1 and 900-1100 cm-1) Faraday rotation and circular dichroism are measured in high Tc superconductors using sensitive polarization modulation techniques. Optimally doped YBCO thin films are studied at temperatures down to 15 K and magnetic fields up to 8 T. At 1000 cm-1 the Hall conductivity varies strongly with temperature in contrast to the longitudinal conductivity which is…
▽ More
Infrared (20-120 cm-1 and 900-1100 cm-1) Faraday rotation and circular dichroism are measured in high Tc superconductors using sensitive polarization modulation techniques. Optimally doped YBCO thin films are studied at temperatures down to 15 K and magnetic fields up to 8 T. At 1000 cm-1 the Hall conductivity varies strongly with temperature in contrast to the longitudinal conductivity which is nearly independent of temperature. The Hall scattering rate has a T^2 temperature dependence but, unlike a Fermi liquid, depends only weakly on frequency. The experiment puts severe constraints on theories of transport in the normal state of high Tc superconductors.
△ Less
Submitted 2 August, 1999;
originally announced August 1999.