Search | arXiv e-print repository

Small Celestial Body Exploration with CubeSat Swarms

Authors: Emmanuel Blazquez, Dario Izzo, Francesco Biscani, Roger Walker, Franco Perez-Lissi

Abstract: This work presents a large-scale simulation study investigating the deployment and operation of distributed swarms of CubeSats for interplanetary missions to small celestial bodies. Utilizing Taylor numerical integration and advanced collision detection techniques, we explore the potential of large CubeSat swarms in capturing gravity signals and reconstructing the internal mass distribution of a s… ▽ More This work presents a large-scale simulation study investigating the deployment and operation of distributed swarms of CubeSats for interplanetary missions to small celestial bodies. Utilizing Taylor numerical integration and advanced collision detection techniques, we explore the potential of large CubeSat swarms in capturing gravity signals and reconstructing the internal mass distribution of a small celestial body while minimizing risks and Delta V budget. Our results offer insight into the applicability of this approach for future deep space exploration missions. △ Less

Submitted 25 August, 2023; originally announced August 2023.

arXiv:2210.13635 [pdf, other]

Toward an Intelligent Tutoring System for Argument Mining in Legal Texts

Authors: Hannes Westermann, Jaromir Savelka, Vern R. Walker, Kevin D. Ashley, Karim Benyekhlef

Abstract: We propose an adaptive environment (CABINET) to support caselaw analysis (identifying key argument elements) based on a novel cognitive computing framework that carefully matches various machine learning (ML) capabilities to the proficiency of a user. CABINET supports law students in their learning as well as professionals in their work. The results of our experiments focused on the feasibility of… ▽ More We propose an adaptive environment (CABINET) to support caselaw analysis (identifying key argument elements) based on a novel cognitive computing framework that carefully matches various machine learning (ML) capabilities to the proficiency of a user. CABINET supports law students in their learning as well as professionals in their work. The results of our experiments focused on the feasibility of the proposed framework are promising. We show that the system is capable of identifying a potential error in the analysis with very low false positives rate (2.0-3.5%), as well as of predicting the key argument element type (e.g., an issue or a holding) with a reasonably high F1-score (0.74). △ Less

Submitted 24 October, 2022; originally announced October 2022.

Comments: Accepted for presentation at the 35th International Conference on Legal Knowledge and Information Systems (JURIX 2022) and publication in the Frontiers of Artificial Intelligence and Applications series of IOS Press

arXiv:2201.06653 [pdf, other]

Data-Centric Machine Learning in the Legal Domain

Authors: Hannes Westermann, Jaromir Savelka, Vern R. Walker, Kevin D. Ashley, Karim Benyekhlef

Abstract: Machine learning research typically starts with a fixed data set created early in the process. The focus of the experiments is finding a model and training procedure that result in the best possible performance in terms of some selected evaluation metric. This paper explores how changes in a data set influence the measured performance of a model. Using three publicly available data sets from the l… ▽ More Machine learning research typically starts with a fixed data set created early in the process. The focus of the experiments is finding a model and training procedure that result in the best possible performance in terms of some selected evaluation metric. This paper explores how changes in a data set influence the measured performance of a model. Using three publicly available data sets from the legal domain, we investigate how changes to their size, the train/test splits, and the human labelling accuracy impact the performance of a trained deep learning classifier. We assess the overall performance (weighted average) as well as the per-class performance. The observed effects are surprisingly pronounced, especially when the per-class performance is considered. We investigate how "semantic homogeneity" of a class, i.e., the proximity of sentences in a semantic embedding space, influences the difficulty of its classification. The presented results have far reaching implications for efforts related to data collection and curation in the field of AI & Law. The results also indicate that enhancements to a data set could be considered, alongside the advancement of the ML models, as an additional path for increasing classification performance on various tasks in AI & Law. Finally, we discuss the need for an established methodology to assess the potential effects of data set properties. △ Less

Submitted 17 January, 2022; originally announced January 2022.

arXiv:2112.11494 [pdf, other]

doi 10.3233/FAIA200860

Sentence Embeddings and High-speed Similarity Search for Fast Computer Assisted Annotation of Legal Documents

Authors: Hannes Westermann, Jaromir Savelka, Vern R. Walker, Kevin D. Ashley, Karim Benyekhlef

Abstract: Human-performed annotation of sentences in legal documents is an important prerequisite to many machine learning based systems supporting legal tasks. Typically, the annotation is done sequentially, sentence by sentence, which is often time consuming and, hence, expensive. In this paper, we introduce a proof-of-concept system for annotating sentences "laterally." The approach is based on the obser… ▽ More Human-performed annotation of sentences in legal documents is an important prerequisite to many machine learning based systems supporting legal tasks. Typically, the annotation is done sequentially, sentence by sentence, which is often time consuming and, hence, expensive. In this paper, we introduce a proof-of-concept system for annotating sentences "laterally." The approach is based on the observation that sentences that are similar in meaning often have the same label in terms of a particular type system. We use this observation in allowing annotators to quickly view and annotate sentences that are semantically similar to a given sentence, across an entire corpus of documents. Here, we present the interface of the system and empirically evaluate the approach. The experiments show that lateral annotation has the potential to make the annotation process quicker and more consistent. △ Less

Submitted 21 December, 2021; originally announced December 2021.

Journal ref: Frontiers in Artificial Intelligence and Applications, Volume 334: Legal Knowledge and Information Systems, 2020, pp. 164-173

arXiv:2112.05807 [pdf, other]

doi 10.3233/FAIA190313

Computer-Assisted Creation of Boolean Search Rules for Text Classification in the Legal Domain

Authors: Hannes Westermann, Jaromir Savelka, Vern R. Walker, Kevin D. Ashley, Karim Benyekhlef

Abstract: In this paper, we present a method of building strong, explainable classifiers in the form of Boolean search rules. We developed an interactive environment called CASE (Computer Assisted Semantic Exploration) which exploits word co-occurrence to guide human annotators in selection of relevant search terms. The system seamlessly facilitates iterative evaluation and improvement of the classification… ▽ More In this paper, we present a method of building strong, explainable classifiers in the form of Boolean search rules. We developed an interactive environment called CASE (Computer Assisted Semantic Exploration) which exploits word co-occurrence to guide human annotators in selection of relevant search terms. The system seamlessly facilitates iterative evaluation and improvement of the classification rules. The process enables the human annotators to leverage the benefits of statistical information while incorporating their expert intuition into the creation of such rules. We evaluate classifiers created with our CASE system on 4 datasets, and compare the results to machine learning methods, including SKOPE rules, Random forest, Support Vector Machine, and fastText classifiers. The results drive the discussion on trade-offs between superior compactness, simplicity, and intuitiveness of the Boolean search rules versus the better performance of state-of-the-art machine learning models for text classification. △ Less

Submitted 10 December, 2021; originally announced December 2021.

Journal ref: Frontiers in Artificial Intelligence and Applications, Volume 322: Legal Knowledge and Information Systems, 2019, pp. 123 - 132

arXiv:2005.04588 [pdf]

Transformer Based Language Models for Similar Text Retrieval and Ranking

Authors: Javed Qadrud-Din, Ashraf Bah Rabiou, Ryan Walker, Ravi Soni, Martin Gajek, Gabriel Pack, Akhil Rangaraj

Abstract: Most approaches for similar text retrieval and ranking with long natural language queries rely at some level on queries and responses having words in common with each other. Recent applications of transformer-based neural language models to text retrieval and ranking problems have been very promising, but still involve a two-step process in which result candidates are first obtained through bag-of… ▽ More Most approaches for similar text retrieval and ranking with long natural language queries rely at some level on queries and responses having words in common with each other. Recent applications of transformer-based neural language models to text retrieval and ranking problems have been very promising, but still involve a two-step process in which result candidates are first obtained through bag-of-words-based approaches, and then reranked by a neural transformer. In this paper, we introduce novel approaches for effectively applying neural transformer models to similar text retrieval and ranking without an initial bag-of-words-based step. By eliminating the bag-of-words-based step, our approach is able to accurately retrieve and rank results even when they have no non-stopwords in common with the query. We accomplish this by using bidirectional encoder representations from transformers (BERT) to create vectorized representations of sentence-length texts, along with a vector nearest neighbor search index. We demonstrate both supervised and unsupervised means of using BERT to accomplish this task. △ Less

Submitted 21 May, 2020; v1 submitted 10 May, 2020; originally announced May 2020.

Comments: 5 pages, 2 figures

arXiv:1912.06981 [pdf, ps, other]

Local Parametric Surface Approximation With Automatic Order Selection From Position Data

Authors: Michael R. Walker II

Abstract: Acquiring an anatomical map from position data is important for medical applications where catheters interact with soft tissues. To improve autonomous navigation in these settings, we require information beyond nonparametric maps typically available. We present an algorithm for local surface approximation from position data with automatic surface order selection. The traditional surface fitting ob… ▽ More Acquiring an anatomical map from position data is important for medical applications where catheters interact with soft tissues. To improve autonomous navigation in these settings, we require information beyond nonparametric maps typically available. We present an algorithm for local surface approximation from position data with automatic surface order selection. The traditional surface fitting objective function is derived from a Bayesian perspective. Posterior probabilities from the occupancy map are incorporated as weights on points selected for surface fitting. Our novel iterative algorithm incorporates surface order selection using the Bayesian information criterion. Simulations demonstrate the ability to automatically select surface order consistent with the latent surface in the presence of noise. Results on human procedure data are also presented. △ Less

Submitted 10 July, 2020; v1 submitted 15 December, 2019; originally announced December 2019.

Comments: Accepted for publication in the 2020 International Symposium on Medical Robotics (ISMR)

arXiv:1708.04713 [pdf, ps, other]

Counting Roots of Polynomials over $\mathbb{Z}/p^2\mathbb{Z}$

Authors: Trajan Hammonds, Jeremy Johnson, Angela Patini, Robert M. Walker

Abstract: Until recently, the only known method of finding the roots of polynomials over prime power rings, other than fields, was brute force. One reason for this is the lack of a division algorithm, obstructing the use of greatest common divisors. Fix a prime $p \in \mathbb{Z}$ and $f \in ( \mathbb{Z}/p^n \mathbb{Z} ) [x]$ any nonzero polynomial of degree $d$ whose coefficients are not all divisible by… ▽ More Until recently, the only known method of finding the roots of polynomials over prime power rings, other than fields, was brute force. One reason for this is the lack of a division algorithm, obstructing the use of greatest common divisors. Fix a prime $p \in \mathbb{Z}$ and $f \in ( \mathbb{Z}/p^n \mathbb{Z} ) [x]$ any nonzero polynomial of degree $d$ whose coefficients are not all divisible by $p$. For the case $n=2$, we prove a new efficient algorithm to count the roots of $f$ in $\mathbb{Z}/p^2\mathbb{Z}$ within time polynomial in $(d+\operatorname{size}(f)+\log{p})$, and record a concise formula for the number of roots, formulated by Cheng, Gao, Rojas, and Wan. △ Less

Submitted 12 December, 2017; v1 submitted 15 August, 2017; originally announced August 2017.

Comments: 6 pages, comments welcome! Rewritten to address referee feedback. Bibliography updated. There is a new Corollary 3.3 giving a formula for the number of degenerate roots modulo p that fail to lift to roots modulo p^2

MSC Class: 11Y05; 11Y16; 13F20 (Primary). 11M38; 11S05; 11T06 (Secondary)

Journal ref: Houston Journal of Mathematics (2018), Vol. 44, no. 4, pp. 1111-1119

arXiv:1605.03009 [pdf]

Consciousness is Pattern Recognition

Authors: Ray Van De Walker

Abstract: This is a proof of the strong AI hypothesis, i.e. that machines can be conscious. It is a phenomenological proof that pattern-recognition and subjective consciousness are the same activity in different terms. Therefore, it proves that essential subjective processes of consciousness are computable, and identifies significant traits and requirements of a conscious system. Since Husserl, many philoso… ▽ More This is a proof of the strong AI hypothesis, i.e. that machines can be conscious. It is a phenomenological proof that pattern-recognition and subjective consciousness are the same activity in different terms. Therefore, it proves that essential subjective processes of consciousness are computable, and identifies significant traits and requirements of a conscious system. Since Husserl, many philosophers have accepted that consciousness consists of memories of logical connections between an ego and external objects. These connections are called "intentions." Pattern recognition systems are achievable technical artifacts. The proof links this respected introspective philosophical theory of consciousness with technical art. The proof therefore endorses the strong AI hypothesis and may therefore also enable a theoretically-grounded form of artificial intelligence called a "synthetic intentionality," able to synthesize, generalize, select and repeat intentions. If the pattern recognition is reflexive, able to operate on the set of intentions, and flexible, with several methods of synthesizing intentions, an SI may be a particularly strong form of AI. Similarities and possible applications to several AI paradigms are discussed. The article then addresses some problems: The proof's limitations, reflexive cognition, Searles' Chinese room, and how an SI could "understand" "meanings" and "be creative." △ Less

Submitted 28 June, 2016; v1 submitted 4 May, 2016; originally announced May 2016.

Comments: 8 pages; Now describes the utility of the proof. Lemma A3 is improved. The root lemma is clarified. Included and excused some basic objections. Reordered the speculations, objections and excuses to be more coherent. Added paragraphs and references to aid some AI paradigms. Added my orcid and revised the abstract

ACM Class: I.2.0

arXiv:cs/0307007 [pdf, ps, other]

Management of Grid Jobs and Information within SAMGrid

Authors: A. Baranovski, G. Garzoglio, A. Kreymer, L. Lueking, S. Stonjek, I. Terekhov, F. Wuerthwein, A. Roy, P. Mhashikar, V. Murthi, T. Tannenbaum, R. Walker, F. Ratnikov, T. Rockwell

Abstract: We describe some of the key aspects of the SAMGrid system, used by the D0 and CDF experiments at Fermilab. Having sustained success of the data handling part of SAMGrid, we have developed new services for job and information services. Our job management is rooted in \CondorG and uses enhancements that are general applicability for HEP grids. Our information system is based on a uniform framework… ▽ More We describe some of the key aspects of the SAMGrid system, used by the D0 and CDF experiments at Fermilab. Having sustained success of the data handling part of SAMGrid, we have developed new services for job and information services. Our job management is rooted in \CondorG and uses enhancements that are general applicability for HEP grids. Our information system is based on a uniform framework for configuration management based on XML data representation and processing. △ Less

Submitted 8 July, 2003; v1 submitted 3 July, 2003; originally announced July 2003.

Comments: 7 pages including figures, presented at CHEP 2003

ACM Class: c.1.4

Journal ref: ECONF C0303241:TUAT002,2003

Showing 1–10 of 10 results for author: Walker, R