Skip to main content

Showing 1–5 of 5 results for author: Gronau, I

.
  1. arXiv:2201.04687  [pdf, other

    cs.SI cs.DB cs.IR

    CompanyName2Vec: Company Entity Matching Based on Job Ads

    Authors: Ran Ziv, Ilan Gronau, Michael Fire

    Abstract: Entity Matching is an essential part of all real-world systems that take in structured and unstructured data coming from different sources. Typically no common key is available for connecting records. Massive data cleaning and integration processes require completion before any data analytics, or further processing can be performed. Although record linkage is frequently regarded as a somewhat tedi… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

  2. arXiv:2108.05848  [pdf, other

    q-bio.BM cs.CE

    Eliminating unwanted patterns with minimal interference

    Authors: Zehavit Leibovich, Ilan Gronau

    Abstract: Artificial synthesis of DNA molecules is an essential part of the study of biological mechanisms. The design of a synthetic DNA molecule usually involves many objectives. One of the important objectives is to eliminate short sequence patterns that correspond to binding sites of restriction enzymes or transcription factors. While many design tools address this problem, no adequate formal solution e… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Comments: This research was done as part of Zehavit Leibovich's dissertation for an M.Sc degree in Computer Science. Relevant code available at https://github.com/zehavitc/EliminatingDNAPatterns.git

  3. arXiv:1306.5110  [pdf, other

    q-bio.PE

    Genome-wide inference of ancestral recombination graphs

    Authors: Matthew D. Rasmussen, Melissa J. Hubisz, Ilan Gronau, Adam Siepel

    Abstract: The complex correlation structure of a collection of orthologous DNA sequences is uniquely captured by the "ancestral recombination graph" (ARG), a complete record of coalescence and recombination events in the history of the sample. However, existing methods for ARG inference are computationally intensive, highly approximate, or limited to small numbers of sequences, and, as a consequence, explic… ▽ More

    Submitted 3 December, 2013; v1 submitted 21 June, 2013; originally announced June 2013.

    Comments: 88 pages, 7 main figures, 22 supplementary figures. This version contains a substantially expanded genomic data analysis

  4. arXiv:1305.7390  [pdf

    q-bio.GN

    Genome Sequencing Highlights Genes Under Selection and the Dynamic Early History of Dogs

    Authors: Adam H. Freedman, Rena M. Schweizer, Ilan Gronau, Eunjung Han, Diego Ortega-Del Vecchyo, Pedro M. Silva, Marco Galaverni, Zhenxin Fan, Peter Marx, Belen Lorente-Galdos, Holly Beale, Oscar Ramirez, Farhad Hormozdiari, Can Alkan, Carles VilĂ , Kevin Squire, Eli Geffen, Josip Kusak, Adam R. Boyko, Heidi G. Parker, Clarence Lee, Vasisht Tadigotla, Adam Siepel, Carlos D. Bustamante, Timothy T. Harkins , et al. (5 additional authors not shown)

    Abstract: To identify genetic changes underlying dog domestication and reconstruct their early evolutionary history, we analyzed novel high-quality genome sequences of three gray wolves, one from each of three putative centers of dog domestication, two ancient dog lineages (Basenji and Dingo) and a golden jackal as an outgroup. We find dogs and wolves diverged through a dynamic process involving population… ▽ More

    Submitted 4 June, 2013; v1 submitted 31 May, 2013; originally announced May 2013.

    Comments: 24 pages, 5 figures. To download the Supporting Information file, use the following link: https://www.dropbox.com/s/2yoytspv1iods7s/Freedman_etal_SupportingInfo_arxiv.pdf

  5. Inference of Natural Selection from Interspersed Genomic Elements Based on Polymorphism and Divergence

    Authors: Ilan Gronau, Leonardo Arbiza, Jaaved Mohammed, Adam Siepel

    Abstract: Complete genome sequences contain valuable information about natural selection, but extracting this information for short, widely scattered noncoding elements remains a challenging problem. Here we introduce a new computational method for addressing this problem called Inference of Natural Selection from Interspersed Genomically coHerent elemenTs (INSIGHT). INSIGHT uses a generative probabilistic… ▽ More

    Submitted 23 April, 2013; v1 submitted 28 September, 2011; originally announced September 2011.

    Comments: 21 page manuscript, 4 figure, 4 tables + 3 supp figures + 3 supp tables + supp methods. V4: additional results on human noncoding RNAs annotated by GENCODE + refinement of previous versions + additional supplementary material included to main document. V5: some minor modifications. V6: this is an electronic version of an article published in Mol Biol Evol, 2013