Skip to main content

Showing 1–38 of 38 results for author: Knudsen, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.13097  [pdf, other

    eess.IV cs.CV cs.LG q-bio.QM

    DISC: Latent Diffusion Models with Self-Distillation from Separated Conditions for Prostate Cancer Grading

    Authors: Man M. Ho, Elham Ghelichkhan, Yosep Chong, Yufei Zhou, Beatrice Knudsen, Tolga Tasdizen

    Abstract: Latent Diffusion Models (LDMs) can generate high-fidelity images from noise, offering a promising approach for augmenting histopathology images for training cancer grading models. While previous works successfully generated high-fidelity histopathology images using LDMs, the generation of image tiles to improve prostate cancer grading has not yet been explored. Additionally, LDMs face challenges i… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Abstract accepted for ISBI 2024. Extended version to be presented at SynData4CV @ CVPR 2024. See more at https://minhmanho.github.io/disc/

  2. arXiv:2404.12650  [pdf, other

    eess.IV cs.CV cs.LG

    F2FLDM: Latent Diffusion Models with Histopathology Pre-Trained Embeddings for Unpaired Frozen Section to FFPE Translation

    Authors: Man M. Ho, Shikha Dubey, Yosep Chong, Beatrice Knudsen, Tolga Tasdizen

    Abstract: The Frozen Section (FS) technique is a rapid and efficient method, taking only 15-30 minutes to prepare slides for pathologists' evaluation during surgery, enabling immediate decisions on further surgical interventions. However, FS process often introduces artifacts and distortions like folds and ice-crystal effects. In contrast, these artifacts and distortions are absent in the higher-quality for… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: Preprint. Our work is available at https://minhmanho.github.io/f2f_ldm/

  3. arXiv:2403.11340  [pdf, other

    eess.IV cs.CV

    StainDiffuser: MultiTask Dual Diffusion Model for Virtual Staining

    Authors: Tushar Kataria, Beatrice Knudsen, Shireen Y. Elhabian

    Abstract: Hematoxylin and Eosin (H&E) staining is the most commonly used for disease diagnosis and tumor recurrence tracking. Hematoxylin excels at highlighting nuclei, whereas eosin stains the cytoplasm. However, H&E stain lacks details for differentiating different types of cells relevant to identifying the grade of the disease or response to specific treatment variations. Pathologists require special imm… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  4. arXiv:2312.06978  [pdf, other

    cs.CV

    CLASS-M: Adaptive stain separation-based contrastive learning with pseudo-labeling for histopathological image classification

    Authors: Bodong Zhang, Hamid Manoochehri, Man Minh Ho, Fahimeh Fooladgar, Yosep Chong, Beatrice S. Knudsen, Deepika Sirohi, Tolga Tasdizen

    Abstract: Histopathological image classification is an important task in medical image analysis. Recent approaches generally rely on weakly supervised learning due to the ease of acquiring case-level labels from pathology reports. However, patch-level classification is preferable in applications where only a limited number of cases are available or when local prediction accuracy is critical. On the other ha… ▽ More

    Submitted 4 January, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  5. arXiv:2308.13182  [pdf, other

    cs.CV cs.AI cs.LG q-bio.QM

    Structural Cycle GAN for Virtual Immunohistochemistry Staining of Gland Markers in the Colon

    Authors: Shikha Dubey, Tushar Kataria, Beatrice Knudsen, Shireen Y. Elhabian

    Abstract: With the advent of digital scanners and deep learning, diagnostic operations may move from a microscope to a desktop. Hematoxylin and Eosin (H&E) staining is one of the most frequently used stains for disease analysis, diagnosis, and grading, but pathologists do need different immunohistochemical (IHC) stains to analyze specific structures or cells. Obtaining all of these stains (H&E and different… ▽ More

    Submitted 25 August, 2023; originally announced August 2023.

    Comments: Accepted to MICCAI Workshop 2023

  6. arXiv:2307.03275  [pdf, other

    cs.CV

    To pretrain or not to pretrain? A case study of domain-specific pretraining for semantic segmentation in histopathology

    Authors: Tushar Kataria, Beatrice Knudsen, Shireen Elhabian

    Abstract: Annotating medical imaging datasets is costly, so fine-tuning (or transfer learning) is the most effective method for digital pathology vision applications such as disease classification and semantic segmentation. However, due to texture bias in models trained on real-world images, transfer learning for histopathology applications might result in underperforming models, which necessitates the need… ▽ More

    Submitted 21 August, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

  7. arXiv:2305.05789  [pdf, other

    cs.CV

    Unsupervised Domain Adaptation for Medical Image Segmentation via Feature-space Density Matching

    Authors: Tushar Kataria, Beatrice Knudsen, Shireen Elhabian

    Abstract: Semantic segmentation is a critical step in automated image interpretation and analysis where pixels are classified into one or more predefined semantically meaningful classes. Deep learning approaches for semantic segmentation rely on harnessing the power of annotated images to learn features indicative of these semantic classes. Nonetheless, they often fail to generalize when there is a signific… ▽ More

    Submitted 6 July, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  8. arXiv:2209.13408  [pdf, other

    eess.IV cs.CV cs.LG

    A Pathologist-Informed Workflow for Classification of Prostate Glands in Histopathology

    Authors: Alessandro Ferrero, Beatrice Knudsen, Deepika Sirohi, Ross Whitaker

    Abstract: Pathologists diagnose and grade prostate cancer by examining tissue from needle biopsies on glass slides. The cancer's severity and risk of metastasis are determined by the Gleason grade, a score based on the organization and morphology of prostate cancer glands. For diagnostic work-up, pathologists first locate glands in the whole biopsy core, and -- if they detect cancer -- they assign a Gleason… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: Published as a workshop paper at MICCAI MOVI 2022

    Journal ref: First International Workshop, MOVI 2022, Held in Conjunction with MICCAI 2022, Singapore, September 18, 2022, Proceedings, https://link.springer.com/book/10.1007/978-3-031-16961-8

  9. arXiv:2206.12505  [pdf, other

    cs.CV

    Stain Based Contrastive Co-training for Histopathological Image Analysis

    Authors: Bodong Zhang, Beatrice Knudsen, Deepika Sirohi, Alessandro Ferrero, Tolga Tasdizen

    Abstract: We propose a novel semi-supervised learning approach for classification of histopathology images. We employ strong supervision with patch-level annotations combined with a novel co-training loss to create a semi-supervised learning framework. Co-training relies on multiple conditionally independent and sufficient views of the data. We separate the hematoxylin and eosin channels in pathology images… ▽ More

    Submitted 26 August, 2022; v1 submitted 24 June, 2022; originally announced June 2022.

  10. Visual attention analysis of pathologists examining whole slide images of Prostate cancer

    Authors: Souradeep Chakraborty, Ke Ma, Rajarsi Gupta, Beatrice Knudsen, Gregory J. Zelinsky, Joel H. Saltz, Dimitris Samaras

    Abstract: We study the attention of pathologists as they examine whole-slide images (WSIs) of prostate cancer tissue using a digital microscope. To the best of our knowledge, our study is the first to report in detail how pathologists navigate WSIs of prostate cancer as they accumulate information for their diagnoses. We collected slide navigation data (i.e., viewport location, magnification level, and time… ▽ More

    Submitted 2 May, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: ISBI 2022 (Oral presentation)

  11. arXiv:2104.05093  [pdf, other

    cs.DS

    Load Balancing with Dynamic Set of Balls and Bins

    Authors: Anders Aamand, Jakob Bæk Tejs Knudsen, Mikkel Thorup

    Abstract: In dynamic load balancing, we wish to distribute balls into bins in an environment where both balls and bins can be added and removed. We want to minimize the maximum load of any bin but we also want to minimize the number of balls and bins affected when adding or removing a ball or a bin. We want a hashing-style solution where we given the ID of a ball can find its bin efficiently. We are given… ▽ More

    Submitted 11 April, 2021; originally announced April 2021.

    Comments: Accepted at STOC'21

  12. arXiv:2008.08654  [pdf, ps, other

    cs.DS

    The Power of Hashing with Mersenne Primes

    Authors: Thomas Dybdahl Ahle, Jakob Tejs Bæk Knudsen, Mikkel Thorup

    Abstract: The classic way of computing a $k$-universal hash function is to use a random degree-$(k-1)$ polynomial over a prime field $\mathbb Z_p$. For a fast computation of the polynomial, the prime $p$ is often chosen as a Mersenne prime $p=2^b-1$. In this paper, we show that there are other nice advantages to using Mersenne primes. Our view is that the hash function's output is a $b$-bit integer that i… ▽ More

    Submitted 6 May, 2021; v1 submitted 19 August, 2020; originally announced August 2020.

  13. arXiv:2004.01156  [pdf, other

    cs.DS

    No Repetition: Fast Streaming with Highly Concentrated Hashing

    Authors: Anders Aamand, Debarati Das, Evangelos Kipouridis, Jakob B. T. Knudsen, Peter M. R. Rasmussen, Mikkel Thorup

    Abstract: To get estimators that work within a certain error bound with high probability, a common strategy is to design one that works with constant probability, and then boost the probability using independent repetitions. Important examples of this approach are small space algorithms for estimating the number of distinct elements in a stream, or estimating the set similarity between large sets. Using sta… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

    Comments: 10 pages

  14. arXiv:1909.01821  [pdf, ps, other

    cs.DS cs.LG math.PR stat.ML

    Almost Optimal Tensor Sketch

    Authors: Thomas D. Ahle, Jakob B. T. Knudsen

    Abstract: We construct a matrix $M\in R^{m\otimes d^c}$ with just $m=O(c\,λ\,\varepsilon^{-2}\text{poly}\log1/\varepsilonδ)$ rows, which preserves the norm $\|Mx\|_2=(1\pm\varepsilon)\|x\|_2$ of all $x$ in any given $λ$ dimensional subspace of $ R^d$ with probability at least $1-δ$. This matrix can be applied to tensors $x^{(1)}\otimes\dots\otimes x^{(c)}\in R^{d^c}$ in $O(c\, m \min\{d,m\})$ time -- hence… ▽ More

    Submitted 3 September, 2019; originally announced September 2019.

  15. arXiv:1909.01410  [pdf, ps, other

    cs.DS

    Oblivious Sketching of High-Degree Polynomial Kernels

    Authors: Thomas D. Ahle, Michael Kapralov, Jakob B. T. Knudsen, Rasmus Pagh, Ameya Velingker, David Woodruff, Amir Zandieh

    Abstract: Kernel methods are fundamental tools in machine learning that allow detection of non-linear dependencies between data without explicitly constructing feature vectors in high dimensional spaces. A major disadvantage of kernel methods is their poor scalability: primitives such as kernel PCA or kernel ridge regression generally take prohibitively large quadratic space and (at least) quadratic time, a… ▽ More

    Submitted 22 December, 2020; v1 submitted 3 September, 2019; originally announced September 2019.

  16. arXiv:1905.13208  [pdf, other

    cs.CV eess.IV

    An attention-based multi-resolution model for prostate whole slide imageclassification and localization

    Authors: Jiayun Li, Wenyuan Li, Arkadiusz Gertych, Beatrice S. Knudsen, William Speier, Corey W. Arnold

    Abstract: Histology review is often used as the `gold standard' for disease diagnosis. Computer aided diagnosis tools can potentially help improve current pathology workflows by reducing examination time and interobserver variability. Previous work in cancer grading has focused mainly on classifying pre-defined regions of interest (ROIs), or relied on large amounts of fine-grained labels. In this paper, we… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

    Comments: 8 pages, 4 figures, CVPR 2019 Towards Causal, Explainable and Universal Medical Visual Diagnosis (MVD) Workshop

  17. arXiv:1905.00369  [pdf, other

    cs.DS

    Fast hashing with Strong Concentration Bounds

    Authors: Anders Aamand, Jakob B. T. Knudsen, Mathias B. T. Knudsen, Peter M. R. Rasmussen, Mikkel Thorup

    Abstract: Previous work on tabulation hashing by Patrascu and Thorup from STOC'11 on simple tabulation and from SODA'13 on twisted tabulation offered Chernoff-style concentration bounds on hash based sums, e.g., the number of balls/keys hashing to a given bin, but under some quite severe restrictions on the expected values of these sums. The basic idea in tabulation hashing is to view a key as consisting of… ▽ More

    Submitted 10 August, 2020; v1 submitted 1 May, 2019; originally announced May 2019.

    Comments: 54 pages, 3 figures. An extended abstract appeared at the 52nd Annual ACM Symposium on Theory of Computing (STOC20)

  18. arXiv:1904.04045  [pdf, other

    cs.DS cs.CG cs.DB

    Subsets and Supermajorities: Optimal Hashing-based Set Similarity Search

    Authors: Thomas Dybdahl Ahle, Jakob Bæk Tejs Knudsen

    Abstract: We formulate and optimally solve a new generalized Set Similarity Search problem, which assumes the size of the database and query sets are known in advance. By creating polylog copies of our data-structure, we optimally solve any symmetric Approximate Set Similarity Search problem, including approximate versions of Subset Search, Maximum Inner Product Search (MIPS), Jaccard Similarity Search and… ▽ More

    Submitted 20 April, 2020; v1 submitted 8 April, 2019; originally announced April 2019.

    MSC Class: 68W01

  19. arXiv:1902.01732  [pdf, other

    cs.CG

    Classifying Convex Bodies by their Contact and Intersection Graphs

    Authors: Anders Aamand, Mikkel Abrahamsen, Jakob Bæk Tejs Knudsen, Peter Michael Reichstein Rasmussen

    Abstract: Suppose that $A$ is a convex body in the plane and that $A_1,\dots,A_n$ are translates of $A$. Such translates give rise to an intersection graph of $A$, $G=(V,E)$, with vertices $V=\{1,\dots,n\}$ and edges $E=\{uv\mid A_u\cap A_v\neq \emptyset\}$. The subgraph $G'=(V, E')$ satisfying that $E'\subset E$ is the set of edges $uv$ for which the interiors of $A_u$ and $A_v$ are disjoint is a unit dist… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.

    Comments: 19 pages, 7 figures

    MSC Class: 52C05

  20. arXiv:1804.09684  [pdf, other

    cs.DS

    Power of $d$ Choices with Simple Tabulation

    Authors: Anders Aamand, Mathias Bæk Tejs Knudsen, Mikkel Thorup

    Abstract: Suppose that we are to place $m$ balls into $n$ bins sequentially using the $d$-choice paradigm: For each ball we are given a choice of $d$ bins, according to $d$ hash functions $h_1,\dots,h_d$ and we place the ball in the least loaded of these bins breaking ties arbitrarily. Our interest is in the number of balls in the fullest bin after all $m$ balls have been placed. Azar et al. [STOC'94] pro… ▽ More

    Submitted 25 April, 2018; originally announced April 2018.

    Comments: Accepted at ICALP 2018

  21. arXiv:1711.08797  [pdf, other

    stat.ML cs.DS cs.LG

    Practical Hash Functions for Similarity Estimation and Dimensionality Reduction

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Mikkel Thorup

    Abstract: Hashing is a basic tool for dimensionality reduction employed in several aspects of machine learning. However, the perfomance analysis is often carried out under the abstract assumption that a truly random unit cost hash function is used, without concern for which concrete hash function is employed. The concrete hash function may work fine on sufficiently random input. The question is if it can be… ▽ More

    Submitted 23 November, 2017; originally announced November 2017.

    Comments: Preliminary version of this paper will appear at NIPS 2017

  22. arXiv:1706.02783  [pdf, other

    cs.DS

    Linear Hashing is Awesome

    Authors: Mathias Bæk Tejs Knudsen

    Abstract: We consider the hash function $h(x) = ((ax+b) \bmod p) \bmod n$ where $a,b$ are chosen uniformly at random from $\{0,1,\ldots,p-1\}$. We prove that when we use $h(x)$ in hashing with chaining to insert $n$ elements into a table of size $n$ the expected length of the longest chain is $\tilde{O}\!\left(n^{1/3}\right)$. The proof also generalises to give the same bound when we use the multiply-shift… ▽ More

    Submitted 8 June, 2017; originally announced June 2017.

    Comments: A preliminary version appeared at FOCS'16

  23. arXiv:1704.04509  [pdf, other

    cs.DS

    The Entropy of Backwards Analysis

    Authors: Mathias Bæk Tejs Knudsen, Mikkel Thorup

    Abstract: Backwards analysis, first popularized by Seidel, is often the simplest most elegant way of analyzing a randomized algorithm. It applies to incremental algorithms where elements are added incrementally, following some random permutation, e.g., incremental Delauney triangulation of a pointset, where points are added one by one, and where we always maintain the Delauney triangulation of the points ad… ▽ More

    Submitted 14 April, 2017; originally announced April 2017.

  24. arXiv:1704.04473  [pdf, other

    cs.DS

    Additive Spanners and Distance Oracles in Quadratic Time

    Authors: Mathias Bæk Tejs Knudsen

    Abstract: Let $G$ be an unweighted, undirected graph. An additive $k$-spanner of $G$ is a subgraph $H$ that approximates all distances between pairs of nodes up to an additive error of $+k$, that is, it satisfies $d_H(u,v) \le d_G(u,v)+k$ for all nodes $u,v$, where $d$ is the shortest path distance. We give a deterministic algorithm that constructs an additive $O\!\left(1\right)$-spanner with… ▽ More

    Submitted 14 April, 2017; originally announced April 2017.

  25. Maximal Unbordered Factors of Random Strings

    Authors: Patrick Hagge Cording, Travis Gagie, Mathias Bæk Tejs Knudsen, Tomasz Kociumaka

    Abstract: A border of a string is a non-empty prefix of the string that is also a suffix of the string, and a string is unbordered if it has no border other than itself. Loptev, Kucherov, and Starikovskaya [CPM 2015] conjectured the following: If we pick a string of length $n$ from a fixed non-unary alphabet uniformly at random, then the expected maximum length of its unbordered factors is $n - O(1)$. We co… ▽ More

    Submitted 17 December, 2018; v1 submitted 14 April, 2017; originally announced April 2017.

    Comments: A preliminary version with weaker results was presented at the 23rd Symposium on String Processing and Information Retrieval (SPIRE '16)

  26. arXiv:1704.02178  [pdf, other

    cs.DS

    New Subquadratic Approximation Algorithms for the Girth

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Morten Stöckel

    Abstract: We consider the problem of approximating the girth, $g$, of an unweighted and undirected graph $G=(V,E)$ with $n$ nodes and $m$ edges. A seminal result of Itai and Rodeh [SICOMP'78] gave an additive $1$-approximation in $O(n^2)$ time, and the main open question is thus how well we can do in subquadratic time. In this paper we present two main results. The first is a $(1+\varepsilon,O(1))$-approx… ▽ More

    Submitted 7 April, 2017; originally announced April 2017.

  27. arXiv:1703.10380  [pdf, other

    cs.DS

    Finding Even Cycles Faster via Capped k-Walks

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Morten Stöckel

    Abstract: In this paper, we consider the problem of finding a cycle of length $2k$ (a $C_{2k}$) in an undirected graph $G$ with $n$ nodes and $m$ edges for constant $k\ge2$. A classic result by Bondy and Simonovits [J.Comb.Th.'74] implies that if $m \ge100k n^{1+1/k}$, then $G$ contains a $C_{2k}$, further implying that one needs to consider only graphs with $m = O(n^{1+1/k})$. Previously the best known a… ▽ More

    Submitted 30 March, 2017; originally announced March 2017.

    Comments: To appear at STOC'17

  28. arXiv:1607.04911  [pdf, other

    cs.DS math.CO

    Near-Optimal Induced Universal Graphs for Bounded Degree Graphs

    Authors: Mikkel Abrahamsen, Stephen Alstrup, Jacob Holm, Mathias Bæk Tejs Knudsen, Morten Stöckel

    Abstract: A graph $U$ is an induced universal graph for a family $F$ of graphs if every graph in $F$ is a vertex-induced subgraph of $U$. For the family of all undirected graphs on $n$ vertices Alstrup, Kaplan, Thorup, and Zwick [STOC 2015] give an induced universal graph with $O\!\left(2^{n/2}\right)$ vertices, matching a lower bound by Moon [Proc. Glasgow Math. Assoc. 1965]. Let $k= \lceil D/2 \rceil$.… ▽ More

    Submitted 21 July, 2016; v1 submitted 17 July, 2016; originally announced July 2016.

  29. arXiv:1507.02618  [pdf, other

    cs.DS

    Sublinear Distance Labeling

    Authors: Stephen Alstrup, Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Ely Porat

    Abstract: A distance labeling scheme labels the $n$ nodes of a graph with binary strings such that, given the labels of any two nodes, one can determine the distance in the graph between the two nodes by looking only at the labels. A $D$-preserving distance labeling scheme only returns precise distances between pairs of nodes that are at distance at least $D$ from each other. In this paper we consider dista… ▽ More

    Submitted 8 September, 2016; v1 submitted 9 July, 2015; originally announced July 2015.

    Comments: A preliminary version of this paper appeared at ESA'16

  30. arXiv:1504.02671  [pdf, ps, other

    cs.DS

    Longest Common Extensions in Sublinear Space

    Authors: Philip Bille, Inge Li Gørtz, Mathias Bæk Tejs Knudsen, Moshe Lewenstein, Hjalte Wedel Vildhøj

    Abstract: The longest common extension problem (LCE problem) is to construct a data structure for an input string $T$ of length $n$ that supports LCE$(i,j)$ queries. Such a query returns the length of the longest common prefix of the suffixes starting at positions $i$ and $j$ in $T$. This classic problem has a well-known solution that uses $O(n)$ space and $O(1)$ query time. In this paper we show that for a… ▽ More

    Submitted 10 April, 2015; originally announced April 2015.

    Comments: An extended abstract of this paper has been accepted to CPM 2015

  31. arXiv:1504.02306  [pdf, other

    cs.DS

    Optimal induced universal graphs and adjacency labeling for trees

    Authors: Stephen Alstrup, Søren Dahlgaard, Mathias Bæk Tejs Knudsen

    Abstract: We show that there exists a graph $G$ with $O(n)$ nodes, where any forest of $n$ nodes is a node-induced subgraph of $G$. Furthermore, for constant arboricity $k$, the result implies the existence of a graph with $O(n^k)$ nodes that contains all $n$-node graphs as node-induced subgraphs, matching a $Ω(n^k)$ lower bound. The lower bound and previously best upper bounds were presented in Alstrup and… ▽ More

    Submitted 15 February, 2016; v1 submitted 9 April, 2015; originally announced April 2015.

    Comments: A preliminary version of this paper appeared at FOCS'15

  32. arXiv:1502.05729  [pdf, other

    cs.DS

    Quicksort, Largest Bucket, and Min-Wise Hashing with Limited Independence

    Authors: Mathias Bæk Tejs Knudsen, Morten Stöckel

    Abstract: Randomized algorithms and data structures are often analyzed under the assumption of access to a perfect source of randomness. The most fundamental metric used to measure how "random" a hash function or a random number generator is, is its independence: a sequence of random variables is said to be $k$-independent if every variable is uniform and every size $k$ subset is independent. In this paper… ▽ More

    Submitted 19 February, 2015; originally announced February 2015.

    Comments: Submitted to ICALP 2015

  33. arXiv:1411.7191  [pdf, ps, other

    cs.DS

    Hashing for statistics over k-partitions

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Eva Rotenberg, Mikkel Thorup

    Abstract: In this paper we analyze a hash function for $k$-partitioning a set into bins, obtaining strong concentration bounds for standard algorithms combining statistics from each bin. This generic method was originally introduced by Flajolet and Martin~[FOCS'83] in order to save a factor $Ω(k)$ of time per element over $k$ independent samples when estimating the number of distinct elements in a data st… ▽ More

    Submitted 15 February, 2016; v1 submitted 26 November, 2014; originally announced November 2014.

    Comments: Appear at FOCS'15

  34. arXiv:1407.6846  [pdf, other

    cs.DS

    The Power of Two Choices with Simple Tabulation

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Eva Rotenberg, Mikkel Thorup

    Abstract: The power of two choices is a classic paradigm for load balancing when assigning $m$ balls to $n$ bins. When placing a ball, we pick two bins according to two hash functions $h_0$ and $h_1$, and place the ball in the least loaded bin. Assuming fully random hash functions, when $m=O(n)$, Azar et al.~[STOC'94] proved that the maximum load is $\lg \lg n + O(1)$ with high probability. In this paper,… ▽ More

    Submitted 25 January, 2016; v1 submitted 25 July, 2014; originally announced July 2014.

    Comments: SODA'16

  35. arXiv:1407.5011  [pdf, other

    cs.DS

    A simple and optimal ancestry labeling scheme for trees

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Noy Rotbart

    Abstract: We present a $\lg n + 2 \lg \lg n+3$ ancestry labeling scheme for trees. The problem was first presented by Kannan et al. [STOC 88'] along with a simple $2 \lg n$ solution. Motivated by applications to XML files, the label size was improved incrementally over the course of more than 20 years by a series of papers. The last, due to Fraigniaud and Korman [STOC 10'], presented an asymptotically optim… ▽ More

    Submitted 26 April, 2015; v1 submitted 18 July, 2014; originally announced July 2014.

    Comments: 12 pages, 1 figure. To appear at ICALP'15

  36. arXiv:1404.4982  [pdf, other

    cs.DS cs.DC

    Dynamic and Multi-functional Labeling Schemes

    Authors: Søren Dahlgaard, Mathias Bæk Tejs Knudsen, Noy Rotbart

    Abstract: We investigate labeling schemes supporting adjacency, ancestry, sibling, and connectivity queries in forests. In the course of more than 20 years, the existence of $\log n + O(\log \log)$ labeling schemes supporting each of these functions was proven, with the most recent being ancestry [Fraigniaud and Korman, STOC '10]. Several multi-functional labeling schemes also enjoy lower or upper bounds of… ▽ More

    Submitted 19 April, 2014; originally announced April 2014.

    Comments: 17 pages, 5 figures

  37. arXiv:1403.0178  [pdf, other

    cs.DS

    Additive Spanners: A Simple Construction

    Authors: Mathias Bæk Tejs Knudsen

    Abstract: We consider additive spanners of unweighted undirected graphs. Let $G$ be a graph and $H$ a subgraph of $G$. The most naïve way to construct an additive $k$-spanner of $G$ is the following: As long as $H$ is not an additive $k$-spanner repeat: Find a pair $(u,v) \in H$ that violates the spanner-condition and a shortest path from $u$ to $v$ in $G$. Add the edges of this path to $H$. We show that,… ▽ More

    Submitted 23 November, 2014; v1 submitted 2 March, 2014; originally announced March 2014.

    Comments: To appear at proceedings of the 14th Scandinavian Symposium and Workshop on Algorithm Theory (SWAT 2014)

  38. arXiv:1102.0059  [pdf, ps, other

    stat.ME cs.CE cs.CV cs.LG q-bio.QM

    Statistical methods for tissue array images - algorithmic scoring and co-training

    Authors: Donghui Yan, Pei Wang, Michael Linden, Beatrice Knudsen, Timothy Randolph

    Abstract: Recent advances in tissue microarray technology have allowed immunohistochemistry to become a powerful medium-to-high throughput analysis tool, particularly for the validation of diagnostic and prognostic biomarkers. However, as study size grows, the manual evaluation of these assays becomes a prohibitive limitation; it vastly reduces throughput and greatly increases variability and expense. We pr… ▽ More

    Submitted 1 October, 2012; v1 submitted 31 January, 2011; originally announced February 2011.

    Comments: Published in at http://dx.doi.org/10.1214/12-AOAS543 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS543

    Journal ref: Annals of Applied Statistics 2012, Vol. 6, No. 3, 1280-1305