Skip to main content

Showing 1–10 of 10 results for author: Noever, S E M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.12383  [pdf

    cs.AI cs.CL cs.CV

    Visual AI and Linguistic Intelligence Through Steerability and Composability

    Authors: David Noever, Samantha Elizabeth Miller Noever

    Abstract: This study explores the capabilities of multimodal large language models (LLMs) in handling challenging multistep tasks that integrate language and vision, focusing on model steerability, composability, and the application of long-term memory and context understanding. The problem addressed is the LLM's ability (Nov 2023 GPT-4 Vision Preview) to manage tasks that require synthesizing visual and te… ▽ More

    Submitted 18 November, 2023; originally announced December 2023.

  2. arXiv:2309.16705  [pdf

    cs.CV cs.CL cs.LG

    Multimodal Analysis Of Google Bard And GPT-Vision: Experiments In Visual Reasoning

    Authors: David Noever, Samantha Elizabeth Miller Noever

    Abstract: Addressing the gap in understanding visual comprehension in Large Language Models (LLMs), we designed a challenge-response study, subjecting Google Bard and GPT-Vision to 64 visual tasks, spanning categories like "Visual Situational Reasoning" and "Next Scene Prediction." Previous models, such as GPT4, leaned heavily on optical character recognition tools like Tesseract, whereas Bard and GPT-Visio… ▽ More

    Submitted 14 October, 2023; v1 submitted 16 August, 2023; originally announced September 2023.

  3. arXiv:2304.02016  [pdf

    cs.CL cs.CV cs.LG

    The Multimodal And Modular Ai Chef: Complex Recipe Generation From Imagery

    Authors: David Noever, Samantha Elizabeth Miller Noever

    Abstract: The AI community has embraced multi-sensory or multi-modal approaches to advance this generation of AI models to resemble expected intelligent understanding. Combining language and imagery represents a familiar method for specific tasks like image captioning or generation from descriptions. This paper compares these monolithic approaches to a lightweight and specialized method based on employing i… ▽ More

    Submitted 19 March, 2023; originally announced April 2023.

  4. arXiv:2207.08766  [pdf

    cs.LG

    Word Play for Playing Othello (Reverses)

    Authors: Samantha E. Miller Noever, David Noever

    Abstract: Language models like OpenAI's Generative Pre-Trained Transformers (GPT-2/3) capture the long-term correlations needed to generate text in a variety of domains (such as language translators) and recently in gameplay (chess, Go, and checkers). The present research applies both the larger (GPT-3) and smaller (GPT-2) language models to explore the complex strategies for the game of Othello (or Reverse… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  5. arXiv:2104.04359  [pdf

    cs.CV cs.LG

    Rock Hunting With Martian Machine Vision

    Authors: David Noever, Samantha E. Miller Noever

    Abstract: The Mars Perseverance rover applies computer vision for navigation and hazard avoidance. The challenge to do onboard object recognition highlights the need for low-power, customized training, often including low-contrast backgrounds. We investigate deep learning methods for the classification and detection of Martian rocks. We report greater than 97% accuracy for binary classifications (rock vs. r… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

  6. arXiv:2103.10480  [pdf

    cs.LG cs.CL cs.CV

    Reading Isn't Believing: Adversarial Attacks On Multi-Modal Neurons

    Authors: David A. Noever, Samantha E. Miller Noever

    Abstract: With Open AI's publishing of their CLIP model (Contrastive Language-Image Pre-training), multi-modal neural networks now provide accessible models that combine reading with visual recognition. Their network offers novel ways to probe its dual abilities to read text while classifying visual objects. This paper demonstrates several new categories of adversarial attacks, spanning basic typographical,… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

  7. arXiv:2103.07765  [pdf

    cs.CR cs.LG

    Image Classifiers for Network Intrusions

    Authors: David A. Noever, Samantha E. Miller Noever

    Abstract: This research recasts the network attack dataset from UNSW-NB15 as an intrusion detection problem in image space. Using one-hot-encodings, the resulting grayscale thumbnails provide a quarter-million examples for deep learning algorithms. Applying the MobileNetV2's convolutional neural network architecture, the work demonstrates a 97% accuracy in distinguishing normal and attack traffic. Further c… ▽ More

    Submitted 13 March, 2021; originally announced March 2021.

  8. arXiv:2103.00602  [pdf

    cs.CR cs.LG

    Virus-MNIST: A Benchmark Malware Dataset

    Authors: David Noever, Samantha E. Miller Noever

    Abstract: The short note presents an image classification dataset consisting of 10 executable code varieties and approximately 50,000 virus examples. The malicious classes include 9 families of computer viruses and one benign set. The image formatting for the first 1024 bytes of the Portable Executable (PE) mirrors the familiar MNIST handwriting dataset, such that most of the previously explored algorithmic… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

  9. arXiv:2102.04266  [pdf

    cs.CV cs.LG

    Overhead MNIST: A Benchmark Satellite Dataset

    Authors: David Noever, Samantha E. Miller Noever

    Abstract: The research presents an overhead view of 10 important objects and follows the general formatting requirements of the most popular machine learning task: digit recognition with MNIST. This dataset offers a public benchmark extracted from over a million human-labelled and curated examples. The work outlines the key multi-class object identification task while matching with prior work in handwriting… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

  10. arXiv:2004.03366  [pdf

    cs.CV cs.LG

    Knife and Threat Detectors

    Authors: David A. Noever, Sam E. Miller Noever

    Abstract: Despite rapid advances in image-based machine learning, the threat identification of a knife wielding attacker has not garnered substantial academic attention. This relative research gap appears less understandable given the high knife assault rate (>100,000 annually) and the increasing availability of public video surveillance to analyze and forensically document. We present three complementary m… ▽ More

    Submitted 8 April, 2020; v1 submitted 4 April, 2020; originally announced April 2020.