\thesubsection Coarse Estimation

We review the one-dimensional, sample-level private coarse estimation as given in KSU’20: Pure DP Range Estimator $\textrm{PDPRE}_{\eps,R,k,u}(X)$ :

1.

If $u\geq 2R$ , return any point in $[-R,R]$ . Otherwise, set parameter $r\leftarrow u/2$ .
2.

Divide $[-R-2r,R+2r]$ into buckets: $[-R-2r,-R),\ldots,[-2r,0),[0,2r),\ldots,[R,R+2r]$ .
3.

Run Pure DP Histogram for $X$ over the above buckets.
4.

Let $[a,b]$ be the bucket that has the maximum number of points.
5.

Return $\mu_{\texttt{coarse}}=\frac{a+b}{2}$ .

{theorem}

[Sample-Level Coarse Estimation] For all $\eps>0$ , PDPRE is $\eps$ -DP. Futhermore, suppose $P$ is a distribution over $\R$ with mean $\mu\in[-R,R]$ and $k$ -th moment bounded by $1$ . Then there exists a small constant $c$ \colorblue(that gets smaller as $k$ gets bigger…) such that, for all $u\geq c$ , there exists

n_{0}=O\Paren{\frac{\log(R/(u\beta))}{\eps}+\log(1/\beta)}

such that, if $n\geq n_{0}$ , then with probability at least $1-\beta$ ,

|\mu-\mu_{\textrm{coarse}}|\leq u.

{proof}

[Proof] Note that the proof of privacy follows directly from \colorblue cite. Thus, the rest of this proof is dedicated to the proof of accuracy. If $u\geq 2R$ , by step $1$ , the coarse estimate will be within $u$ of $\mu$ . Otherwise, we show that $\max(|a-\mu|,|b-\mu|)\leq u$ , which implies that $|\mu_{\texttt{coarse}}-\mu|\leq u$ . We first show that, with probability at least $1-\beta/2$ , the heaviest (non-noisy) bucket in $[-R-2r,R+2r]$ (i.e. the bucket with the most samples) must intersect with $[\mu-r,\mu+r]$ . If the (noisy) bucket $[a,b]$ discovered in our algorithm is also the heaviest non-noisy bucket, then this would immediately imply $\max(|a-\mu|,|b-\mu|)\leq 2r=u$ . To prove this, it suffices to show that only at most $n/16$ samples are outside of $[\mu-r,\mu+r]$ . This event would suffice as the heaviest bucket not intersecting with $[\mu-r,\mu+r]$ would only have at most $n/16$ samples while, on the other hand, the heaviest bucket that intersects with $[\mu-r,\mu+r]$ will have more than $n/16$ samples–at least $(15n/16)/3\geq n/4$ samples. We begin by calculating the expected number of samples that fall outside of the interval $[\mu-r,\mu+r]$ . If we set $Y_{i}=\mathbb{I}(X_{i}\notin[\mu-r,\mu+r])$ , then this is equivalent to calculating $\E[\sum Y_{i}]$ :

\E\Brac{\sum_{i=1}^{n}Y_{i}}=n\Pr[|X_{i}-\mu|\geq r]\leq n/r^{k},

where the last inequality comes from the bounded $k$ -th moment assumption and Markov’s inequality. Thus, we can show that, with probability at most $\beta/2$ , more than $n/16$ samples fall outside of $\mu\pm r$ :

\Pr\Brac{\sum_{i=1}^{n}Y_{i}\geq\frac{n}{16}}\leq\Pr\Brac{\sum_{i=1}^{n}Y_{i}% \geq\frac{r^{k}}{16}\cdot\frac{n}{r^{k}}}\leq\exp\Paren{-\Theta(n)}\leq\beta/2,

by a Chernoff bound, so long as $2r=u\geq c$ for some constant $c$ and $n\geq\Theta(\ln(1/\beta))$ . Finally, we show that, with probability at least $1-\beta/2$ , the heaviest non-noisy bucket is also the heaviest noisy bucket, completing the proof. By Lemma(\colorbluecite), we know that, with probability at least $1-\beta/2$ , the largest magnitude of the noise in any bucket will not exceed $n/16$ , so long as $n\geq\Theta\Paren{\ln(R/(r\beta))/\eps}$ . Thus, the heaviest non-noisy bucket will remain the heaviest bucket after noise is added to all of the buckets, completing the proof. {corollary}[User-Level Coarse Estimation] Let $P$ be a distribution over $\R$ with mean $\mu\in\brac{-R,R}$ and $k$ -th moment bounded by $1$ . Then for all $\eps>0$ , there exists an $\eps$ -DP user-level algorithm that takes $n\geq n_{0}$ many users where

n_{0}=\cO_{k}\Paren{\frac{\log(Rm/\beta)}{\eps}}

and outputs $\mu_{\texttt{coarse}}$ such that

\Abs{\mu_{\texttt{coarse}}-\mu}\leq\Theta\Paren{\sqrt{\frac{1}{m}}},

where $m$ is the number of samples per users. {proof} \Crefthm:sample-level-coarse-esitmation, together with \Crefthm:user-level-to-sample-level-reduction, and setting the accuracy parameter to $\Theta\Paren{\sqrt{\frac{1}{m}}}$ readily proves this corollary. {remark} This is sufficient for the purposes in the fine estimation setting as the two values the clip** radius obtains are both larger than $\sqrt{\frac{1}{m}}$ . Therefore having a coarse estimate with accuracy $\sqrt{\frac{1}{m}}$ , would be sufficient