Skip to main content

Showing 1–2 of 2 results for author: Tolochinsky, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:1805.07697  [pdf

    cs.CL

    The UN Parallel Corpus Annotated for Translation Direction

    Authors: Elad Tolochinsky, Ohad Mosafi, Ella Rabinovich, Shuly Wintner

    Abstract: This work distinguishes between translated and original text in the UN protocol corpus. By modeling the problem as classification problem, we can achieve up to 95% classification accuracy. We begin by deriving a parallel corpus for different language-pairs annotated for translation direction, and then classify the data by using various feature extraction methods. We compare the different methods a… ▽ More

    Submitted 19 May, 2018; originally announced May 2018.

  2. arXiv:1802.07382  [pdf, other

    cs.LG cs.DS

    Generic Coreset for Scalable Learning of Monotonic Kernels: Logistic Regression, Sigmoid and more

    Authors: Elad Tolochinsky, Ibrahim Jubran, Dan Feldman

    Abstract: Coreset (or core-set) is a small weighted \emph{subset} $Q$ of an input set $P$ with respect to a given \emph{monotonic} function $f:\mathbb{R}\to\mathbb{R}$ that \emph{provably} approximates its fitting loss $\sum_{p\in P}f(p\cdot x)$ to \emph{any} given $x\in\mathbb{R}^d$. Using $Q$ we can obtain approximation of $x^*$ that minimizes this loss, by running \emph{existing} optimization algorithms… ▽ More

    Submitted 23 December, 2021; v1 submitted 20 February, 2018; originally announced February 2018.