Skip to main content

Showing 1–20 of 20 results for author: Zaniolo, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.14857  [pdf, other

    cs.DB

    Demonstration of LogicLib: An Expressive Multi-Language Interface over Scalable Datalog System

    Authors: Mingda Li, ** Wang, Guorui Xiao, Youfu Li, Carlo Zaniolo

    Abstract: With the ever-increasing volume of data, there is an urgent need to provide expressive and efficient tools to support Big Data analytics. The declarative logical language Datalog has proven very effective at expressing concisely graph, machine learning, and knowledge discovery applications via recursive queries. In this demonstration, we develop Logic Library (LLib), a library of recursive algorit… ▽ More

    Submitted 5 September, 2022; v1 submitted 30 May, 2022; originally announced May 2022.

    Comments: 4 pages

    MSC Class: 68M14 ACM Class: H.2.3

  2. arXiv:2103.04283  [pdf, ps, other

    q-bio.MN cs.LG q-bio.BM q-bio.GN

    Bio-JOIE: Joint Representation Learning of Biological Knowledge Bases

    Authors: Junheng Hao, Chelsea Ju, Muhao Chen, Yizhou Sun, Carlo Zaniolo, Wei Wang

    Abstract: The widespread of Coronavirus has led to a worldwide pandemic with a high mortality rate. Currently, the knowledge accumulated from different studies about this virus is very limited. Leveraging a wide-range of biological knowledge, such as gene ontology and protein-protein interaction (PPI) networks from other closely related species presents a vital approach to infer the molecular impact of a ne… ▽ More

    Submitted 7 March, 2021; originally announced March 2021.

    Comments: ACM BCB 2020, Best Student Paper

    Journal ref: In Procs of the 11th ACM BCB, pp. 1-10. 2020

  3. arXiv:2010.03158  [pdf, other

    cs.CL cs.AI cs.LG

    Multilingual Knowledge Graph Completion via Ensemble Knowledge Transfer

    Authors: Xuelu Chen, Muhao Chen, Changjun Fan, Ankith Uppunda, Yizhou Sun, Carlo Zaniolo

    Abstract: Predicting missing facts in a knowledge graph (KG) is a crucial task in knowledge base construction and reasoning, and it has been the subject of much research in recent works using KG embeddings. While existing KG embedding approaches mainly learn and predict facts within a single KG, a more plausible solution would benefit from the knowledge in multiple language-specific KGs, considering that di… ▽ More

    Submitted 8 October, 2020; v1 submitted 7 October, 2020; originally announced October 2020.

    Comments: Findings of EMNLP 2020

  4. arXiv:1910.08888  [pdf, ps, other

    cs.DB

    Monotonic Properties of Completed Aggregates in Recursive Queries

    Authors: Carlo Zaniolo, Ariyam Das, Jiaqi Gu, Youfu Li, Mingda li, ** Wang

    Abstract: The use of aggregates in recursion enables efficient and scalable support for a wide range of BigData algorithms, including those used in graph applications, KDD applications, and ML applications, which have proven difficult to be expressed and supported efficiently in BigData systems supporting Datalog or SQL. The problem with these languages and systems is that, to avoid the semantic and computa… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.

  5. arXiv:1909.08249  [pdf, ps, other

    cs.LO cs.DB cs.LG

    BigData Applications from Graph Analytics to Machine Learning by Aggregates in Recursion

    Authors: Ariyam Das, Youfu Li, ** Wang, Mingda Li, Carlo Zaniolo

    Abstract: In the past, the semantic issues raised by the non-monotonic nature of aggregates often prevented their use in the recursive statements of logic programs and deductive databases. However, the recently introduced notion of Pre-mappability (PreM) has shown that, in key applications of interest, aggregates can be used in recursion to optimize the perfect-model semantics of aggregate-stratified progra… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

    Comments: In Proceedings ICLP 2019, arXiv:1909.07646. Paper presented at the 35th International Conference on Logic Programming (ICLP 2019), Las Cruces, New Mexico, USA, 20-25 September 2019, 7 pages (short paper - applications track)

    Journal ref: EPTCS 306, 2019, pp. 273-279

  6. arXiv:1907.10278  [pdf, other

    cs.PL cs.DB cs.DC cs.LO

    A Case for Stale Synchronous Distributed Model for Declarative Recursive Computation

    Authors: Ariyam Das, Carlo Zaniolo

    Abstract: A large class of traditional graph and data mining algorithms can be concisely expressed in Datalog, and other Logic-based languages, once aggregates are allowed in recursion. In fact, for most BigData algorithms, the difficult semantic issues raised by the use of non-monotonic aggregates in recursion are solved by Pre-Mappability (PreM), a property that assures that for a program with aggregates… ▽ More

    Submitted 24 July, 2019; originally announced July 2019.

    Comments: Paper presented at the 35th International Conference on Logic Programming (ICLP 2019), Las Cruces, New Mexico, USA, 20-25 September 2019, 16 pages

  7. arXiv:1812.01250  [pdf, other

    cs.CL cs.LG

    Quantification and Analysis of Scientific Language Variation Across Research Fields

    Authors: Pei Zhou, Muhao Chen, Kai-Wei Chang, Carlo Zaniolo

    Abstract: Quantifying differences in terminologies from various academic domains has been a longstanding problem yet to be solved. We propose a computational approach for analyzing linguistic variation among scientific research fields by capturing the semantic change of terms based on a neural language model. The model is trained on a large collection of literature in five computer science research fields,… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

    Comments: Accepted in ICDM Workshop on Cross-disciplinary Data Exchange and Collaboration (CDEC). 2018. 5 pages, 2 figures

  8. arXiv:1811.10667  [pdf, ps, other

    cs.AI cs.CL

    Embedding Uncertain Knowledge Graphs

    Authors: Xuelu Chen, Muhao Chen, Weijia Shi, Yizhou Sun, Carlo Zaniolo

    Abstract: Embedding models for deterministic Knowledge Graphs (KG) have been extensively studied, with the purpose of capturing latent semantic relations between entities and incorporating the structured knowledge into machine learning. However, there are many KGs that model uncertain knowledge, which typically model the inherent uncertainty of relations facts with a confidence score, and embedding such unc… ▽ More

    Submitted 25 February, 2019; v1 submitted 26 November, 2018; originally announced November 2018.

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 33. 2019

  9. On2Vec: Embedding-based Relation Prediction for Ontology Population

    Authors: Muhao Chen, Yingtao Tian, Xuelu Chen, Zijun Xue, Carlo Zaniolo

    Abstract: Populating ontology graphs represents a long-standing problem for the Semantic Web community. Recent advances in translation-based graph embedding methods for populating instance-level knowledge graphs lead to promising new approaching for the ontology population problem. However, unlike instance-level graphs, the majority of relation facts in ontology graphs come with comprehensive semantic relat… ▽ More

    Submitted 7 September, 2018; originally announced September 2018.

    Comments: SDM-18. 9 pages, 3 figures

  10. arXiv:1808.03726  [pdf, other

    cs.CL cs.AI cs.LG

    Learning to Represent Bilingual Dictionaries

    Authors: Muhao Chen, Yingtao Tian, Haochen Chen, Kai-Wei Chang, Steven Skiena, Carlo Zaniolo

    Abstract: Bilingual word embeddings have been widely used to capture the similarity of lexical semantics in different human languages. However, many applications, such as cross-lingual semantic search and question answering, can be largely benefited from the cross-lingual correspondence between sentences and lexicons. To bridge this gap, we propose a neural embedding model that leverages bilingual dictionar… ▽ More

    Submitted 6 September, 2019; v1 submitted 10 August, 2018; originally announced August 2018.

    Comments: CoNLL 2019

  11. arXiv:1807.11689  [pdf, other

    cs.IR cs.CL cs.HC

    Neural Article Pair Modeling for Wikipedia Sub-article Matching

    Authors: Muhao Chen, Chang** Meng, Gang Huang, Carlo Zaniolo

    Abstract: Nowadays, editors tend to separate different subtopics of a long Wiki-pedia article into multiple sub-articles. This separation seeks to improve human readability. However, it also has a deleterious effect on many Wikipedia-based tasks that rely on the article-as-concept assumption, which requires each entity (or concept) to be described solely by one article. This underlying assumption significan… ▽ More

    Submitted 4 August, 2018; v1 submitted 31 July, 2018; originally announced July 2018.

    Comments: ECML-PKDD 2018. 16 pages, 4 figures

  12. arXiv:1807.02957  [pdf, other

    cs.DB cs.LO cs.PL

    Scaling-Up Reasoning and Advanced Analytics on BigData

    Authors: Tyson Condie, Ariyam Das, Matteo Interlandi, Alexander Shkapsky, Mohan Yang, Carlo Zaniolo

    Abstract: BigDatalog is an extension of Datalog that achieves performance and scalability on both Apache Spark and multicore systems to the point that its graph analytics outperform those written in GraphX. Looking back, we see how this realizes the ambitious goal pursued by deductive database researchers beginning forty years ago: this is the goal of combining the rigor and power of logic in expressing que… ▽ More

    Submitted 9 July, 2018; originally announced July 2018.

    Comments: Under consideration in Theory and Practice of Logic Programming (TPLP)

  13. arXiv:1806.06478  [pdf, other

    cs.AI cs.CL

    Co-training Embeddings of Knowledge Graphs and Entity Descriptions for Cross-lingual Entity Alignment

    Authors: Muhao Chen, Yingtao Tian, Kai-Wei Chang, Steven Skiena, Carlo Zaniolo

    Abstract: Multilingual knowledge graph (KG) embeddings provide latent semantic representations of entities and structured knowledge with cross-lingual inferences, which benefit various knowledge-driven cross-lingual NLP tasks. However, precisely learning such cross-lingual inferences is usually hindered by the low coverage of entity alignment in many KGs. Since many multilingual KGs also provide literal des… ▽ More

    Submitted 17 June, 2018; originally announced June 2018.

    Comments: To appear in IJCAI-18

  14. arXiv:1806.00914  [pdf, other

    cs.IR cs.HC cs.LG stat.ML

    How Much Are You Willing to Share? A "Poker-Styled" Selective Privacy Preserving Framework for Recommender Systems

    Authors: Manoj Reddy Dareddy, Ariyam Das, Junghoo Cho, Carlo Zaniolo

    Abstract: Most industrial recommender systems rely on the popular collaborative filtering (CF) technique for providing personalized recommendations to its users. However, the very nature of CF is adversarial to the idea of user privacy, because users need to share their preferences with others in order to be grouped with like-minded people and receive accurate recommendations. While previous privacy preserv… ▽ More

    Submitted 3 June, 2018; originally announced June 2018.

  15. arXiv:1707.05681  [pdf, other

    cs.DB

    Fixpoint Semantics and Optimization of Recursive Datalog Programs with Aggregates

    Authors: Carlo Zaniolo, Mohan Yang, Matteo Interlandi, Ariyam Das, Alexander Shkapsky, Tyson Condie

    Abstract: A very desirable Datalog extension investigated by many researchers in the last thirty years consists in allowing the use of the basic SQL aggregates min, max, count and sum in recursive rules. In this paper, we propose a simple comprehensive solution that extends the declarative least-fixpoint semantics of Horn Clauses, along with the optimization techniques used in the bottom-up implementation a… ▽ More

    Submitted 21 July, 2017; v1 submitted 18 July, 2017; originally announced July 2017.

    Comments: Paper presented at the 33nd International Conference on Logic Programming (ICLP 2017), Melbourne, Australia, August 28 to September 1, 2017. 16 pages, LaTeX (arXiv:1707.05681)

  16. arXiv:1611.03954  [pdf, other

    cs.AI cs.CL

    Multilingual Knowledge Graph Embeddings for Cross-lingual Knowledge Alignment

    Authors: Muhao Chen, Yingtao Tian, Mohan Yang, Carlo Zaniolo

    Abstract: Many recent works have demonstrated the benefits of knowledge graph embeddings in completing monolingual knowledge graphs. Inasmuch as related knowledge bases are built in several different languages, achieving cross-lingual knowledge alignment will help people in constructing a coherent knowledge base, and assist machines in dealing with different expressions of entity relationships across divers… ▽ More

    Submitted 17 May, 2017; v1 submitted 11 November, 2016; originally announced November 2016.

    Comments: Extended version of the IJCAI-17 paper

    ACM Class: I.2.4; I.2.6; I.2.7

  17. arXiv:1207.0142  [pdf, other

    cs.DB

    Early Accurate Results for Advanced Analytics on MapReduce

    Authors: Nikolay Laptev, Kai Zeng, Carlo Zaniolo

    Abstract: Approximate results based on samples often provide the only way in which advanced analytical applications on very massive data sets can satisfy their time and resource constraints. Unfortunately, methods and tools for the computation of accurate early results are currently not supported in MapReduce-oriented systems although these are intended for `big data'. Therefore, we proposed and implemented… ▽ More

    Submitted 30 June, 2012; originally announced July 2012.

    Comments: VLDB2012

    Journal ref: Proceedings of the VLDB Endowment (PVLDB), Vol. 5, No. 10, pp. 1028-1039 (2012)

  18. arXiv:cs/0702151  [pdf, ps, other

    cs.DS

    Succinct Sampling on Streams

    Authors: Vladimir Braverman, Rafail Ostrovsky, Carlo Zaniolo

    Abstract: A streaming model is one where data items arrive over long period of time, either one item at a time or in bursts. Typical tasks include computing various statistics over a sliding window of some fixed time-horizon. What makes the streaming model interesting is that as the time progresses, old items expire and new ones arrive. One of the simplest and central tasks in this model is sampling. That… ▽ More

    Submitted 14 April, 2008; v1 submitted 25 February, 2007; originally announced February 2007.

  19. arXiv:cs/0312041  [pdf, ps, other

    cs.DB cs.AI

    Greedy Algorithms in Datalog

    Authors: Sergio Greco, Carlo Zaniolo

    Abstract: In the design of algorithms, the greedy paradigm provides a powerful tool for solving efficiently classical computational problems, within the framework of procedural languages. However, expressing these algorithms within the declarative framework of logic-based languages has proven a difficult research challenge. In this paper, we extend the framework of Datalog-like languages to obtain simple… ▽ More

    Submitted 18 December, 2003; originally announced December 2003.

    Comments: 27 pages

    ACM Class: D.1.6; F.3.1; F.4.1

    Journal ref: Theory and Practice of Logic Programming, 1(4): 381-407, 2001

  20. arXiv:cs/0202001  [pdf, ps, other

    cs.DB cs.AI

    The Deductive Database System LDL++

    Authors: Faiz Arni, KayLiang Ong, Shalom Tsur, Haixun Wang, Carlo Zaniolo

    Abstract: This paper describes the LDL++ system and the research advances that have enabled its design and development. We begin by discussing the new nonmonotonic and nondeterministic constructs that extend the functionality of the LDL++ language, while preserving its model-theoretic and fixpoint semantics. Then, we describe the execution model and the open architecture designed to support these new cons… ▽ More

    Submitted 1 February, 2002; originally announced February 2002.

    ACM Class: D.3.2