Skip to main content

Showing 101–129 of 129 results for author: Sheth, A

.
  1. A Quality Type-aware Annotated Corpus and Lexicon for Harassment Research

    Authors: Mohammadreza Rezvan, Saeedeh Shekarpour, Lakshika Balasuriya, Krishnaprasad Thirunarayan, Valerie Shalin, Amit Sheth

    Abstract: Having a quality annotated corpus is essential especially for applied research. Despite the recent focus of Web science community on researching about cyberbullying, the community dose not still have standard benchmarks. In this paper, we publish first, a quality annotated corpus and second, an offensive words lexicon capturing different types type of harassment as (i) sexual harassment, (ii) raci… ▽ More

    Submitted 23 May, 2018; v1 submitted 26 February, 2018; originally announced February 2018.

  2. Machine learning for Internet of Things data analysis: A survey

    Authors: Mohammad Saeid Mahdavinejad, Mohammadreza Rezvan, Mohammadamin Barekatain, Peyman Adibi, Payam Barnaghi, Amit P. Sheth

    Abstract: Rapid developments in hardware, software, and communication technologies have allowed the emergence of Internet-connected sensory devices that provide observation and data measurement from the physical world. By 2020, it is estimated that the total number of Internet-connected devices being used will be between 25 and 50 billion. As the numbers grow and technologies become more mature, the volume… ▽ More

    Submitted 17 February, 2018; originally announced February 2018.

    Comments: Digital Communications and Networks (2017)

  3. arXiv:1801.00356  [pdf

    cs.CY cs.AI

    How will the Internet of Things enable Augmented Personalized Health?

    Authors: Amit Sheth, Utkarshani Jaimini, Hong Yung Yip

    Abstract: Internet-of-Things (IoT) is profoundly redefining the way we create, consume, and share information. Health aficionados and citizens are increasingly using IoT technologies to track their sleep, food intake, activity, vital body signals, and other physiological observations. This is complemented by IoT systems that continuously collect health-related data from the environment and inside the living… ▽ More

    Submitted 31 December, 2017; originally announced January 2018.

  4. arXiv:1710.05429  [pdf, other

    cs.CL

    Semi-Supervised Approach to Monitoring Clinical Depressive Symptoms in Social Media

    Authors: Amir Hossein Yazdavar, Hussein S. Al-Olimat, Monireh Ebrahimi, Goonmeet Bajaj, Tanvi Banerjee, Krishnaprasad Thirunarayan, Jyotishman Pathak, Amit Sheth

    Abstract: With the rise of social media, millions of people are routinely expressing their moods, feelings, and daily struggles with mental health issues on social media platforms like Twitter. Unlike traditional observational cohort studies conducted through questionnaires and self-reported surveys, we explore the reliable detection of clinical depression from tweets obtained unobtrusively. Based on the an… ▽ More

    Submitted 15 October, 2017; originally announced October 2017.

    Comments: 8 pages, Advances in Social Networks Analysis and Mining (ASONAM), 2017 IEEE/ACM International Conference

  5. arXiv:1710.02514  [pdf

    cs.CL

    On the Challenges of Sentiment Analysis for Dynamic Events

    Authors: Monireh Ebrahimi, Amir Hossein Yazdavar, Amit Sheth

    Abstract: With the proliferation of social media over the last decade, determining people's attitude with respect to a specific topic, document, interaction or events has fueled research interest in natural language processing and introduced a new channel called sentiment and emotion analysis. For instance, businesses routinely look to develop systems to automatically understand their customer conversations… ▽ More

    Submitted 6 October, 2017; originally announced October 2017.

    Comments: 9 pages, 2 figures ,IEEE Intelligent Systems 2017

  6. arXiv:1708.03105  [pdf, other

    cs.CL

    Location Name Extraction from Targeted Text Streams using Gazetteer-based Statistical Language Models

    Authors: Hussein S. Al-Olimat, Krishnaprasad Thirunarayan, Valerie Shalin, Amit Sheth

    Abstract: Extracting location names from informal and unstructured social media data requires the identification of referent boundaries and partitioning compound names. Variability, particularly systematic variability in location names (Carroll, 1983), challenges the identification task. Some of this variability can be anticipated as operations within a statistical language model, in this case drawn from ga… ▽ More

    Submitted 26 April, 2020; v1 submitted 10 August, 2017; originally announced August 2017.

    Comments: https://www.aclweb.org/anthology/C18-1169.pdf

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: In The 27th International Conference on Computational Linguistics (COLING 2018)

  7. Implicit Entity Linking in Tweets

    Authors: Sujan Perera, Pablo N. Mendes, Adarsh Alex, Amit Sheth, Krishnaprasad Thirunarayan

    Abstract: Over the years, Twitter has become one of the largest communication platforms providing key data to various applications such as brand monitoring, trend detection, among others. Entity linking is one of the major tasks in natural language understanding from tweets and it associates entity mentions in text to corresponding entries in knowledge bases in order to provide unambiguous interpretation an… ▽ More

    Submitted 26 July, 2017; originally announced July 2017.

    Comments: This paper was accepted at the Extended Semantic Web Conference 2016 as a full research track paper

  8. Knowledge will Propel Machine Understanding of Content: Extrapolating from Current Examples

    Authors: Amit Sheth, Sujan Perera, Sanjaya Wijeratne, Krishnaprasad Thirunarayan

    Abstract: Machine Learning has been a big success story during the AI resurgence. One particular stand out success relates to learning from a massive amount of data. In spite of early assertions of the unreasonable effectiveness of data, there is increasing recognition for utilizing knowledge whenever it is available or can be created purposefully. In this paper, we discuss the indispensable role of knowled… ▽ More

    Submitted 14 July, 2017; originally announced July 2017.

    Comments: Pre-print of the paper accepted at 2017 IEEE/WIC/ACM International Conference on Web Intelligence (WI). arXiv admin note: substantial text overlap with arXiv:1610.07708

  9. A Semantics-Based Measure of Emoji Similarity

    Authors: Sanjaya Wijeratne, Lakshika Balasuriya, Amit Sheth, Derek Doran

    Abstract: Emoji have grown to become one of the most important forms of communication on the web. With its widespread use, measuring the similarity of emoji has become an important problem for contemporary text processing since it lies at the heart of sentiment analysis, search, and interface design tasks. This paper presents a comprehensive analysis of the semantic similarity of emoji through embedding mod… ▽ More

    Submitted 14 July, 2017; originally announced July 2017.

    Comments: This paper is accepted at Web Intelligence 2017 as a full paper, In 2017 IEEE/WIC/ACM International Conference on Web Intelligence (WI). Leipzig, Germany: ACM, 2017

  10. arXiv:1707.04652  [pdf, other

    cs.CL cs.SI

    EmojiNet: An Open Service and API for Emoji Sense Discovery

    Authors: Sanjaya Wijeratne, Lakshika Balasuriya, Amit Sheth, Derek Doran

    Abstract: This paper presents the release of EmojiNet, the largest machine-readable emoji sense inventory that links Unicode emoji representations to their English meanings extracted from the Web. EmojiNet is a dataset consisting of: (i) 12,904 sense labels over 2,389 emoji, which were extracted from the web and linked to machine-readable sense definitions seen in BabelNet, (ii) context words associated wit… ▽ More

    Submitted 14 July, 2017; originally announced July 2017.

    Comments: This paper was published at ICWSM 2017 as a full paper, Proc. of the 11th International AAAI Conference on Web and Social Media (ICWSM 2017). Montreal, Canada. 2017

  11. arXiv:1701.07490  [pdf

    cs.SI q-bio.OT

    What Are People Tweeting about Zika? An Exploratory Study Concerning Symptoms, Treatment, Transmission, and Prevention

    Authors: Michele Miller, Dr. Tanvi Banerjee, RoopTeja Muppalla, Dr. William Romine, Dr. Amit Sheth

    Abstract: The purpose of this study was to do a dataset distribution analysis, a classification performance analysis, and a topical analysis concerning what people are tweeting about four disease characteristics: symptoms, transmission, prevention, and treatment. A combination of natural language processing and machine learning techniques were used to determine what people are tweeting about Zika. Specifica… ▽ More

    Submitted 17 January, 2017; originally announced January 2017.

  12. arXiv:1701.05724  [pdf, other

    cs.AI cs.DB

    Logical Inferences with Contexts of RDF Triples

    Authors: Vinh Nguyen, Amit Sheth

    Abstract: Logical inference, an integral feature of the Semantic Web, is the process of deriving new triples by applying entailment rules on knowledge bases. The entailment rules are determined by the model-theoretic semantics. Incorporating context of an RDF triple (e.g., provenance, time, and location) into the inferencing process requires the formal semantics to be capable of describing the context of RD… ▽ More

    Submitted 20 January, 2017; originally announced January 2017.

  13. arXiv:1701.05625  [pdf, other

    cs.CL

    CEVO: Comprehensive EVent Ontology Enhancing Cognitive Annotation

    Authors: Saeedeh Shekarpour, Faisal Alshargi, Valerie Shalin, Krishnaprasad Thirunarayan, Amit P. Sheth

    Abstract: While the general analysis of named entities has received substantial research attention on unstructured as well as structured data, the analysis of relations among named entities has received limited focus. In fact, a review of the literature revealed a deficiency in research on the abstract conceptualization required to organize relations. We believe that such an abstract conceptualization can b… ▽ More

    Submitted 3 October, 2018; v1 submitted 19 January, 2017; originally announced January 2017.

  14. arXiv:1610.09516  [pdf, other

    cs.SI cs.CL cs.CY cs.IR

    Finding Street Gang Members on Twitter

    Authors: Lakshika Balasuriya, Sanjaya Wijeratne, Derek Doran, Amit Sheth

    Abstract: Most street gang members use Twitter to intimidate others, to present outrageous images and statements to the world, and to share recent illegal activities. Their tweets may thus be useful to law enforcement agencies to discover clues about recent crimes or to anticipate ones that may occur. Finding these posts, however, requires a method to discover gang member Twitter profiles. This is a challen… ▽ More

    Submitted 29 October, 2016; originally announced October 2016.

    Comments: 8 pages, 9 figures, 2 tables, Published as a full paper at 2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2016)

    Journal ref: The 2016 IEEE/ACM Int. Conf. on Advances in Social Networks Analysis and Mining. vol. 8, pp. 685-692. San Francisco, CA, USA (2016)

  15. arXiv:1610.08597  [pdf, other

    cs.SI cs.CL cs.CY cs.IR

    Word Embeddings to Enhance Twitter Gang Member Profile Identification

    Authors: Sanjaya Wijeratne, Lakshika Balasuriya, Derek Doran, Amit Sheth

    Abstract: Gang affiliates have joined the masses who use social media to share thoughts and actions publicly. Interestingly, they use this public medium to express recent illegal actions, to intimidate others, and to share outrageous images and statements. Agencies able to unearth these profiles may thus be able to anticipate, stop, or hasten the investigation of gang-related crimes. This paper investigates… ▽ More

    Submitted 26 October, 2016; originally announced October 2016.

    Comments: 7 pages, 1 figure, 2 tables, Published at IJCAI Workshop on Semantic Machine Learning (SML 2016)

    Journal ref: IJCAI Workshop on Semantic Machine Learning (SML 2016). pp. 18-24. CEUR-WS, New York City, NY (07 2016)

  16. arXiv:1610.07710  [pdf, other

    cs.CL

    EmojiNet: Building a Machine Readable Sense Inventory for Emoji

    Authors: Sanjaya Wijeratne, Lakshika Balasuriya, Amit Sheth, Derek Doran

    Abstract: Emoji are a contemporary and extremely popular way to enhance electronic communication. Without rigid semantics attached to them, emoji symbols take on different meanings based on the context of a message. Thus, like the word sense disambiguation task in natural language processing, machines also need to disambiguate the meaning or sense of an emoji. In a first step toward achieving this goal, thi… ▽ More

    Submitted 24 October, 2016; originally announced October 2016.

    Comments: 15 pages, 4 figures, 3 tables, Accepted to publish at the 8th International Conference on Social Informatics (SocInfo 2016) as a full research track paper

    ACM Class: I.2.7

  17. arXiv:1610.07708   

    cs.AI cs.CL

    Knowledge will Propel Machine Understanding of Content: Extrapolating from Current Examples

    Authors: Amit Sheth, Sujan Perera, Sanjaya Wijeratne

    Abstract: Machine Learning has been a big success story during the AI resurgence. One particular stand out success relates to unsupervised learning from a massive amount of data, albeit much of it relates to one modality/type of data at a time. In spite of early assertions of the unreasonable effectiveness of data, there is increasing recognition of utilizing knowledge whenever it is available or can be cre… ▽ More

    Submitted 22 January, 2019; v1 submitted 24 October, 2016; originally announced October 2016.

    Comments: There is a new version of this paper with new authors uploaded as arXiv:1707.05308, so this is an invalid entry

    ACM Class: I.2

  18. arXiv:1609.09014  [pdf, other

    cs.OH

    SWoTSuite: A Development Framework for Prototy** Cross-domain Semantic Web of Things Applications

    Authors: Pankesh Patel, Amelie Gyrard, Dhavalkumar Thakker, Amit Sheth, Martin Serrano

    Abstract: Semantic Web of Things (SWoT) applications focus on providing a wide-scale interoperability that allows the sharing of IoT devices across domains and the reusing of available knowledge on the web. However, the application development is difficult because developers have to do various tasks such as designing an application, annotating IoT data, interpreting data, and combining application domains.… ▽ More

    Submitted 28 September, 2016; originally announced September 2016.

    Comments: 8 pages

  19. arXiv:1606.07988  [pdf, other

    cs.SE

    Building the Web of Knowledge with Smart IoT Applications (Extended Version)

    Authors: Amelie Gyrard, Pankesh Patel, Amit Sheth, Martin Serrano

    Abstract: The Internet of Things (IoT) is experiencing fast adoption in the society, from industrial to home applications. The number of deployed sensors and connected devices to the Internet is changing our perspective and the way we understand the world. The development and generation of IoT applications is just starting and they will modify our physical and virtual lives, from how we control remotely app… ▽ More

    Submitted 25 June, 2016; originally announced June 2016.

    Comments: 7 pages, 3 figure

  20. arXiv:1606.00480  [pdf, other

    cs.DB

    A Formal Graph Model for RDF and Its Implementation

    Authors: Vinh Nguyen, Jyoti Leeka, Olivier Bodenreider, Amit Sheth

    Abstract: Formalizing an RDF abstract graph model to be compatible with the RDF formal semantics has remained one of the foundational problems in the Semantic Web. In this paper, we propose a new formal graph model for RDF datasets. This model allows us to express the current model-theoretic semantics in the form of a graph. We also propose the concepts of resource path and triple path as well as an algorit… ▽ More

    Submitted 1 June, 2016; originally announced June 2016.

  21. arXiv:1510.05963  [pdf

    cs.AI

    Semantic, Cognitive, and Perceptual Computing: Advances toward Computing for Human Experience

    Authors: Amit Sheth, Pramod Anantharam, Cory Henson

    Abstract: The World Wide Web continues to evolve and serve as the infrastructure for carrying massive amounts of multimodal and multisensory observations. These observations capture various situations pertinent to people's needs and interests along with all their idiosyncrasies. To support human-centered computing that empower people in making better and timely decisions, we look towards computation that is… ▽ More

    Submitted 20 October, 2015; originally announced October 2015.

    Comments: 13 pages, 4 Figures, IEEE Computer

  22. arXiv:1509.04513  [pdf, ps, other

    cs.AI cs.DB

    On Reasoning with RDF Statements about Statements using Singleton Property Triples

    Authors: Vinh Nguyen, Olivier Bodenreider, Krishnaprasad Thirunarayan, Gang Fu, Evan Bolton, Núria Queralt Rosinach, Laura I. Furlong, Michel Dumontier, Amit Sheth

    Abstract: The Singleton Property (SP) approach has been proposed for representing and querying metadata about RDF triples such as provenance, time, location, and evidence. In this approach, one singleton property is created to uniquely represent a relationship in a particular context, and in general, generates a large property hierarchy in the schema. It has become the subject of important questions from Se… ▽ More

    Submitted 15 September, 2015; originally announced September 2015.

  23. arXiv:1509.02822  [pdf, other

    cs.DB cs.PF

    Exposing Provenance Metadata Using Different RDF Models

    Authors: Gang Fu, Evan Bolton, Núria Queralt Rosinach, Laura I. Furlong, Vinh Nguyen, Amit Sheth, Olivier Bodenreider, Michel Dumontier

    Abstract: A standard model for exposing structured provenance metadata of scientific assertions on the Semantic Web would increase interoperability, discoverability, reliability, as well as reproducibility for scientific discourse and evidence-based knowledge discovery. Several Resource Description Framework (RDF) models have been proposed to track provenance. However, provenance metadata may not only be ve… ▽ More

    Submitted 9 September, 2015; originally announced September 2015.

  24. arXiv:1503.02086  [pdf

    cs.SI cs.CY

    Gender-Based Violence in 140 Characters or Fewer: A #BigData Case Study of Twitter

    Authors: Hemant Purohit, Tanvi Banerjee, Andrew Hampton, Valerie L. Shalin, Nayanesh Bhandutia, Amit P. Sheth

    Abstract: Public institutions are increasingly reliant on data from social media sites to measure public attitude and provide timely public engagement. Such reliance includes the exploration of public views on important social issues such as gender-based violence (GBV). In this study, we examine big (social) data consisting of nearly fourteen million tweets collected from Twitter over a period of ten months… ▽ More

    Submitted 29 June, 2015; v1 submitted 6 March, 2015; originally announced March 2015.

    ACM Class: H.1.2; J.4

  25. arXiv:1503.00760  [pdf

    cs.SI

    On Using Synthetic Social Media Stimuli in an Emergency Preparedness Functional Exercise

    Authors: Andrew Hampton, Shreyansh Bhatt, Alan Smith, Jeremy Brunn, Hemant Purohit, Valerie L. Shalin, John M. Flach, Amit P. Sheth

    Abstract: This paper details the creation and use of a massive (over 32,000 messages) artificially constructed 'Twitter' microblog stream for a regional emergency preparedness functional exercise. By combining microblog conversion, manual production, and a control set, we created a web based information stream providing valid, misleading, and irrelevant information to public information officers (PIOs) repr… ▽ More

    Submitted 2 March, 2015; originally announced March 2015.

    Comments: 18 pages

  26. arXiv:1411.3761  [pdf, other

    cs.IR

    A Hybrid Approach to Finding Relevant Social Media Content for Complex Domain Specific Information Needs

    Authors: Delroy Cameron, Amit Sheth, Nishita Jaykumar, Krishnaprasad Thirunarayan, Gaurish Anand, Gary A. Smith

    Abstract: While contemporary semantic search systems offer to improve classical keyword-based search, they are not always adequate for complex domain specific information needs. The domain of prescription drug abuse, for example, requires knowledge of both ontological concepts and 'intelligible constructs' not typically modeled in ontologies. These intelligible constructs convey essential information that i… ▽ More

    Submitted 13 November, 2014; originally announced November 2014.

    Comments: Accepted for publication: Journal of Web Semantics, Elsevier

    ACM Class: H.3.3

  27. arXiv:1410.4977  [pdf

    cs.NI cs.DC

    Semantic Gateway as a Service architecture for IoT Interoperability

    Authors: Pratikkumar Desai, Amit Sheth, Pramod Anantharam

    Abstract: The Internet of Things (IoT) is set to occupy a substantial component of future Internet. The IoT connects sensors and devices that record physical observations to applications and services of the Internet. As a successor to technologies such as RFID and Wireless Sensor Networks (WSN), the IoT has stumbled into vertical silos of proprietary systems, providing little or no interoperability with sim… ▽ More

    Submitted 18 October, 2014; originally announced October 2014.

    Comments: 16 pages

  28. arXiv:1212.0141  [pdf, other

    cs.SI physics.soc-ph

    On the Role of Social Identity and Cohesion in Characterizing Online Social Communities

    Authors: Hemant Purohit, Yiye Ruan, David Fuhry, Srinivasan Parthasarathy, Amit Sheth

    Abstract: Two prevailing theories for explaining social group or community structure are cohesion and identity. The social cohesion approach posits that social groups arise out of an aggregation of individuals that have mutual interpersonal attraction as they share common characteristics. These characteristics can range from common interests to kinship ties and from social values to ethnic backgrounds. In c… ▽ More

    Submitted 1 December, 2012; originally announced December 2012.

    ACM Class: H.5.3; J.4

  29. arXiv:1210.0595  [pdf

    cs.IR cs.DB

    From Questions to Effective Answers: On the Utility of Knowledge-Driven Querying Systems for Life Sciences Data

    Authors: Amir H. Asiaee, Prashant Doshi, Todd Minning, Satya Sahoo, Priti Parikh, Amit Sheth, Rick L. Tarleton

    Abstract: We compare two distinct approaches for querying data in the context of the life sciences. The first approach utilizes conventional databases to store the data and intuitive form-based interfaces to facilitate easy querying of the data. These interfaces could be seen as implementing a set of "pre-canned" queries commonly used by the life science researchers that we study. The second approach is bas… ▽ More

    Submitted 1 October, 2012; originally announced October 2012.