Abstract
We establish a polynomial ergodic theorem for actions of the
affine group of a countable field .
As an application, we deduce–via a variant of Furstenberg’s
correspondence principle–that for fields of characteristic
zero, any “large” set contains
“many” patterns of the form , for every
non-constant polynomial .
Our methods are flexible enough that they allow us to recover
analogous density results in
the setting of finite fields and, with the aid of a
new finitistic variant of Bergelson’s “colouring trick”,
show that for fixed, any colouring of a large
enough finite field will contain monochromatic
patterns of
the form .
In a different direction,
we obtain a double ergodic theorem
for actions of the affine group of a countable field. An
adaptation of the argument for affine actions of finite fields
leads to a generalisation of a theorem of Shkredov. Finally,
to highlight the utility of the aforementioned finitistic
“colouring trick”, we provide a conditional, elementary
generalisation of Green and Sanders’ theorem.
1.1 Historic background
A well-known and still open question of Hindman
(see, for example, [9]) reads as follows.
Question 1.1.
Given any finite colouring of , do there always exists such that is monochromatic, i.e. and all have the same colour?
In [11], Moreira proved the following
result marking significant progress towards
an answer to Question 1.1.
Theorem 1.2 (Moreira).
For any finite colouring of there exist (infinitely many) such that is monochromatic.
Prior to Moreira’s theorem, Shkredov ([12])
addressed its analogue for finite fields of prime order
proving two density results.
Theorem 1.3 (Shkredov).
Let be a finite field of prime order . If are any sets with , then there exist
such that and .
Theorem 1.4 (Shkredov).
Let be a finite field of prime order . If are any sets with , then there exist such that , and .
It follows from Theorem 1.4 that if
is -coloured and is large enough relative to , then there exist
such that is monochromatic. Later, the analogue of
Question 1.1 for finite fields of
prime order was solved by Green and Sanders in [7] via the
following quantitative result.
Theorem 1.5 (Green-Sanders).
Let be fixed and be a finite field of prime
order , with large enough. For any
-colouring of there
are at least monochromatic quadruples ,
where does not depend on .
Observe that Theorems 1.3 and 1.4 are density results, while there
is no density version of the partition regularity
Theorem 1.5. This was pointed out by
Shkredov in [12].
In the context of countable fields, Bowen and Sabok in [4] gave a positive answer to the analogue of Question
1.1. By a compactness
principle they also solved the analogue of this question
for all finite fields as a corollary of their main theorem.
Before that, Bergelson and Moreira in [3] established the
following analogue of Theorem 1.2 using
methods from ergodic theory.
Theorem 1.6 (Bergelson-Moreira).
Let be a countable
field and consider a finite colouring ,
. Then, there exists a colour , ,
and “many” , such that
In this setting, an appropriate notion of
largeness, which guarantees patterns involving both addition and
multiplication in
any large set, turns out to be that of positive upper density
with respect to double Følner sequences. We recall the
definition given in [3].
Definition 1.7.
Let be a countable field. A double Følner sequence in is a sequence of (non-empty) finite subsets which is asymptotically invariant under any fixed affine transformation of , that is,
|
|
|
for any .
This notion of sequence allows us to define asymptotic densities with good properties such as shift invariance. For a countable field and a double Følner sequence in as above, given a set , its upper density with respect to is defined as
|
|
|
Moreover, its lower density with respect to is defined as
|
|
|
and whenever the limit exists we say that has a density
with respect to given by
Using a “colouring trick” Bergelson and Moreira
were able to recover Theorem
1.6 from essentially the following theorem,
which we state vaguely.
Theorem 1.8 (Bergelson-Moreira).
Let be a countable field, be a
double Følner sequence in and with
. Then, there exist “many”
such that .
An advantage of the statement of Theorem 1.8, over that of Theorem 1.6, is that it’s form can be handled with ergodic
theoretic
tools and methods. This is a general
principle, discovered by Furstenberg in his
seminal
proof of Szemerédi’s theorem (see [6]). There
he introduced a correspondence principle, which
often allows one to translate a
problem of finding patterns in large sets (subsets of
the
integers, of semi-groups, of fields, etc.) to a problem
about recurrence in measure preserving systems.
The following ergodic theorem from [3], whose proof
utilizes
the group of affine transformations of a field ,
defined as
, implies Theorem 1.8.
We write for the map , if and for , if .
Theorem 1.9 (Bergelson-Moreira).
Let be a countable field and be a double
Følner sequence in . Let
be a probability space on which we assume
that acts by measure preserving
transformations (m.p.t. for short). Then, given any , we have that
|
|
|
1.2 Main results
A question which occurs naturally is whether we can extend Theorem
1.6, by finding monochromatic patterns of
the form , where is a polynomial over ,
other than . This is addressed by our first
main result (stated somewhat vaguely
for now) which we formulate after an important–throughout this
paper–definition.
Definition 1.10.
Given a field
with prime characteristic , we say that a non-constant polynomial is admissible for , if . If is a countable field with , then any non-constant polynomial is admissible for .
Theorem 1.11.
Let be a countable field and be any
admissible polynomial. Then, for any finite colouring there exists a colour , , and “many” , so that
The density theorem which we will use to prove
Theorem 1.11 is the
following.
Theorem 1.12.
Let be a countable field, be a double Følner sequence in and with . Then, for any admissible polynomial
there exist “many” such that .
In the same spirit as in the end of Section ,
Theorem 1.12 is implied
by an ergodic theorem.
Theorem 1.13.
Let , and be as in the
statement of Theorem 1.12. Let
be a probability space on which we assume
that acts by measure preserving
transformations. Then, given any we have that
|
|
|
where the limit is in and
denotes the orthogonal projection onto the subspace of -invariant functions.
The proof of this statement is based on that of
Bergelson and Moreira’s proof of Theorem
1.9, with additional applications
of van der Corput type of lemmas to facilitate an
induction
argument on the degree of the polynomial. This appears
especially in the proof of the
polynomial mean ergodic theorem of Proposition
3.2.
We also finitise the arguments used to
prove Theorem 1.13 in order to recover the following analogue of
our main density result, Theorem 1.12, in the setting of finite fields.
Theorem 1.14.
Let be a finite field and let
be an admissible polynomial over of degree . Then, if
with , there are , so that and .
In particular, letting , we have the finite field version of
the density statement that there exist such that
, provided
is large enough.
We also produce a new finitistic version of
the “colouring trick” mentioned earlier and with the aid of
Theorem 1.14 recover the next partition
regularity result.
Theorem 1.15.
Let be fixed. Then, there exists with the following property. If
is any finite field with and
and is a polynomial of , then for any
finite colouring , there
is a colour and , such that
A special case of this theorem (when ) is the
partition
regularity corollary of Shkredov’s Theorem
1.4 mentioned after its statement. An
advantage of the ergodic theoretic techniques used
here is that we can recover more general polynomial
patterns and also that the result holds for all finite
fields and not only . A perhaps more
interesting feature, however, is the use of the novel–
in the finitistic setting–“colouring trick”, which,
in a way, allows us to recover this partition
regularity statement from a weaker density theorem.
In a different direction we are also interested in the
question of
section of [3]. Namely, is it true that under the
assumptions of Theorem 1.9 above we get triple
intersections of the form
for some
? A generalization of the next non-commutative
double ergodic theorem, without
the assumption of ergodicity, would answer this question in the
affirmative.
Theorem 1.16.
Let be a countable field and be a double Følner sequence in . Let
be a probability space on which we assume
that acts by measure preserving
transformations and (crucially) we further assume that the action
of the additive subgroup is ergodic. Then,
given any , we have that
|
|
|
Unfortunately, we were unable to recover the result in its full generality.
However, we make a natural conjecture.
Conjecture 1.17.
In the context of Theorem 1.16, if does not act ergodically, then given any , we have that
|
|
|
In a relevant direction, Theorem 1.3
was generalised to all finite fields, initially by
Cilleruelo ([5, Corollary ]) and
subsequently by Hanson ([8, Theorem ]) and
Bergelson and Moreira ([3, Theorem ]).
However, a generalisation of Theorem 1.4 to any finite field remained open and we address
this problem hereby through a “finitisation” of
Theorem 1.16.
Theorem 1.18.
Let be any finite field and let be any
sets satisfying . Then, there exist
such that , and .
The ideas and techniques appearing in the proof of
Theorem 1.16 spring from
classical ergodic theoretic
arguments used in proving multiple ergodic theorems.
In this regard, the proof of Theorem 1.18, which is more or less a “finitisation” of the
above-mentioned proof, is different from
Shkredov’s original combinatorial proof of Theorem 1.4.
Finally, by using the finitistic “colouring trick”
and a finitistic version of Conjecture 1.17, we provide an elementary, conditional
proof of the following generalisation of Green and Sanders’
Theorem 1.5.
Conjecture 1.19.
Let be fixed. Then, there is ,
so that if is any finite field with and , there are
quadruples monochromatic , where does
not depend on .
Acknowledgments. The author expresses gratitude to
his advisor, Joel Moreira, for his guidance and beneficial
discussions during the preparation of this paper.
Thanks also go to Matt Bowen, Nikos Frantzikinakis and
Andreas Mountakis for comments on earlier drafts.
2 Preliminaries and some useful results
2.1 The action of the affine group
For a countable field , we denote by the group of affine transformations of , with the operation of composition. The additive subgroup of is denoted by and consists of the transformations , for . Similarly, the multiplicative subgroup, denoted by , consists of transformations of the form for .
The map can be represented by the
composition and we have the trivial, but very useful throughout this paper, identity:
|
|
|
(2.1) |
The affine group appears naturally in our considerations because in
order, for example,
to find patterns in a subset we can show
that for some , the intersection is non-empty.
We have already mentioned the utility of double Følner sequences as averaging schemes in . The existence of
such sequences was proved in Proposition of [3].
Proposition 2.1.
Any countable field admits a sequence of non-empty finite sets
which forms a Følner sequence for both the
actions of the additive group and the multiplicative group
. In other words, for any , we have that
|
|
|
According to Lemma in [3], some
transformations of double Følner sequences remain
double Følner sequences.
Lemma 2.2.
Let be a countable field. If is a double Følner sequence in and , then is still a double Følner sequence in .
We will further consider a probability space and a measure preserving action of on . In this context, we denote by and let be given by , for and . This is known as the unitary Koopman representation of . Abusing notation we will usually write instead of and instead of . By we denote the orthogonal projection from onto the subspace of vectors which are fixed by the action of the additive subgroup . Also, by we denote the orthogonal projection from onto the subspace of vectors fixed under the action of .
The useful and unintuitive fact that the projections
and commute was established in Lemma of [3].
Lemma 2.3.
For any we have that
|
|
|
By Lemma 2.3 we see that is
invariant under the actions of both and and that is an orthogonal projection. Since the subgroups and generate the whole group , it follows that is the orthogonal projection from onto the subspace of vectors fixed under the action of .
2.2 Ergodic theorems and van der Corput lemmas
The mean ergodic theorem for unitary
representations of countable abelian
groups, which we will extend later for our purposes, has the following form and a proof of
this version can be found for example in [1], Theorem .
Theorem 2.4.
Let be a countable abelian group and be a
Følner sequence in . Let also be a Hilbert space and be a unitary representation of on . Then for any ,
|
|
|
where the limit is in the strong topology of and denotes the orthogonal projection onto the subspace
of vectors fixed under .
We will consider an adaptation of
the van der Corput
lemma for unitary representations of countable abelian groups. A proof–of a stronger version–appears
in Theorem of [2].
Lemma 2.5.
Let be a countable abelian group and be a
bounded sequence of vectors in a Hilbert space , indexed by the
elements of . Let be a Følner sequence in . If
|
|
|
then also
|
|
|
Another version of the van der Corput lemma,
which will be used in Section
6, follows as a
corollary of the inequality given
in Lemma , Chapter of Host and Kra’s book [10].
Proposition 2.6.
Let be a countable abelian group with identity and for each let be a
bounded sequence of vectors in a Hilbert space with norm , indexed by the
elements of . Let be a Følner sequence in . If for all ,
|
|
|
then also
|
|
|
For finite groups, a version of the van
der Corput lemma is given by the following simple
equality. We will use this to adapt our infinite
ergodic theorems to the setting of finite fields.
Proposition 2.7.
Let be a finite group and be a sequence
taking values in a Hilbert space .
Then,
|
|
|
Finally, we shall find the next classical result useful.
Lemma 2.8.
Let be a bounded, non-negative sequence,
indexed by
elements of a countable (amenable) group and let
be a Følner sequence in . Then
|
|
|
Throughout this section we assume that is a countable field,
is a double Følner sequence in and is a non-constant admissible polynomial over , according to Definition 1.10. We also let be a
probability space on which we assume that
acts by measure preserving
transformations. In consistency with the notation from Section
2, , denotes the orthogonal
projection from onto the subspace of functions fixed under the action
of and , are the orthogonal projections on the
subspaces of vectors fixed under the additive action and the
multiplicative action , respectively. Moreover, is the unitary Koopman representation of (for details recall the discussion after Lemma 2.2). Again, for simplicity, we will write instead of and instead of .
Before embarking on the proof of Theorem 1.13
we show the ensuing, straightforward corollary of it.
Corollary 3.1.
If , , and
are as above, then for any ,
we have that
|
|
|
Proof.
For we see that
|
|
|
which can be written as (using that is preserves , for all )
|
|
|
(3.1) |
By Theorem 1.13 applied for , (3.1) becomes
|
|
|
For the last inequality observe that is an orthogonal projection and so
|
|
|
by the Cauchy-Schwarz inequality. Finally, because we have that
|
|
|
and thus we conclude.
∎
For the special case , the proof of Theorem
1.13 was given in [3]. We only mention
that in the proof of the linear case in [3], the authors
relied on a version of the mean ergodic Theorem
2.4 for the action of . For the
polynomial case of Theorem 1.13 we will use
the subsequent generalization, which is a polynomial mean ergodic
theorem for the action of . For that we will need an
application of the van
der Corput trick utilizing the
additive structure of , which facilitates an
induction argument on the polynomial’s degree.
Theorem 3.2.
Let be a countable field and
be admissible. Let also
be a double Følner sequence in and
a probability space, on which acts by measure preserving transformations (see also the beginning of this
section). Then, given any we have that
|
|
|
where the limit is in the strong topology of .
Proof.
We prove the case , some (see
also Remark 3.3). If
, where and , then it follows by the
mean ergodic theorem that
|
|
|
Note that here we used the fact that
is still a Følner sequence for the additive group , in view of Lemma 2.2 and the obvious observation that shifts of Følner sequences are also Følner sequences in any group. Now,
assume the statement holds for polynomials of degree , where and let have degree , i.e., , and . First, we let be such that and set . Then, for any , we have that
|
|
|
Observe that
|
|
|
where . Therefore,
|
|
|
where , and since , the above argument
shows that the polynomial has degree in .
We note that an issue arises
in allowing the polynomial’s degree to be , in which case if, for example, ,
then is a constant, because in
a field of characteristic .
Returning to the proof, by the induction hypothesis and the
assumption on , we see that for any ,
|
|
|
Thus, an application of the van der Corput trick as in Lemma 2.5 gives us that
|
|
|
in , when . Finally, for a general we can write and from the above and linearity it follows that
|
|
|
∎
We will now give the proof of Theorem 1.13,
the statement of which we recall for the reader’s convenience.
Theorem 1.13.
Let , , , and be as in the beginning of this section. Then, given any we have that
|
|
|
where the limit is in the strong topology of .
Proof.
Let and assume that . For we
now set and then, for any we
have that
|
|
|
If , and ( if ), then
|
|
|
which, for fixed, is also a polynomial of degree .
Thus, applying Theorem 3.2 we have that for ,
|
|
|
Once again, the van der Corput lemma implies that for ,
|
|
|
and this allows us to conclude just like in the case of Theorem 3.2, after decomposing a general as .
∎
Using some quantitative bounds for the set of return
times, which can be extracted from the proof of
Corollary 3.1,
and the
variant of Furstenberg’s correspondence
principle established in Theorem of [3], we
can recover the following precise version of Theorem
1.12. The proof is a
straightforward adaptation of the proof of Theorem
from Theorem in [3], which amounts
to the special case that .
Theorem 3.4.
Let be a countable field, an admissible polynomial and
be a double Følner sequence in . Let with Then, for any we have that
|
|
|
In less precise terms, for each element of a large set of there is a large set of satisfying .
To conclude the results of this section we give a precise
statement of Theorem 1.11.
Theorem 3.5.
Let be a countable field, a double Følner sequence in and an admissible polynomial. Then, for any finite colouring
,
there exists a colour such that
|
|
|
The proof of Theorem
3.5 is based on the
“colouring trick” of (and is almost identical to) the
proof of Theorem in [3], and
therefore is omitted. The only difference being that we rely
on Corollary 3.1,
while in [3] the authors relied on its special case of a
linear polynomial.
It seems like our methods are not rigid enough to deal with
non-admissible polynomials according to Definition
1.10 because of the comment in the proof
of Theorem 3.2, so we make
the following natural questions.
Question 3.6.
Does Corollary 3.1 hold
if is not admissible?
Question 3.7.
Does Theorem 3.5 (or a
vague version as in Theorem
1.11) hold for non-admissible
polynomials ?
We note that a positive answer to Question 3.6 would also
imply a positive answer to Question 3.7 by the same
argument that is used for the case of admissible polynomials.
4 A finite fields version of Theorem 1.12
In this section we will adapt the proof of Theorem 1.12 to the finite fields setting and prove Theorem 1.14.
For a finite field we consider its group of affine transformations,
, which consists of the maps of the form
where and . We also let be a probability space on which acts by measure preserving
transformations, with denoting the action. As before, we let , where
and , where . Also, in an abuse of notation, if
is the Koopman
representation of on we write for and for , where for example, for we have that .
Moreover, if is the orthogonal projection onto the space of
functions invariant under the subgroup , we see that
and if is
the projection onto the space of functions invariant
under , then .
We will begin with a finitistic version of the
polynomial mean ergodic theorem of Section 3 and then prove an analogue of
Theorem 1.13. As in the infinite case, and exhibit commuting behavior (see the proof of Theorem in [3]).
Proposition 4.1.
For and , as above, we have that .
Thus, is an orthogonal projection onto the
subspace of functions invariant under .
The promised finitistic analogue of Theorem 3.2 is this.
Proposition 4.2.
Let be a finite field and assume that acts on as in the beginning of this section. Let also be an admissible polynomial of degree . Then, for any we have that
|
|
|
Proof.
If , and , this is obvious, for , whence it is enough to make a change of variables and use the definition of . Assume now that the conclusion holds for polynomials of degree at most and let be a polynomial of degree , where , some . Then,
|
|
|
where , so that Clearly,
|
|
|
On the other hand, by Proposition 2.7 it follows that
|
|
|
(4.1) |
Since , the polynomial has degree for any (this would no longer be true if the degree of was just like the infinite field case), and since , the induction hypothesis implies that
|
|
|
(4.2) |
Finally, we see that
|
|
|
which, by an application of the Cauchy-Schwarz inequality is bounded above by
|
|
|
(4.3) |
Using (4.2) in (4.3) and then by (4.1) it follows that
|
|
|
∎
We isolate the following estimate that appears in the proof of
the finitistic analogue of Corollary 3.1, that is, Theorem 4.4
below. This estimate is the finitistic analogue of
Theorem 1.13 for functions orthogonal to the
space of functions fixed under the action of .
Proposition 4.3.
Let be a finite field and assume that acts on
as in the beginning of this section. Let also
be an admissible polynomial of degree
. Let for some
. Then,
|
|
|
(4.4) |
Proof.
From Proposition 2.7 we have that
|
|
|
(4.5) |
Now, for (in fact for any , but this wouldn’t lead to a practically useful bound) it is easy to see that
|
|
|
(4.6) |
On the other hand, for any , we have
|
|
|
(4.7) |
Moreover,
|
|
|
(4.8) |
But, if , then is a polynomial of same degree as , and so by Proposition 4.2 and because , (4.8) becomes
|
|
|
Using this in (4.7) we get that (for
|
|
|
(4.9) |
Combining (4.6) and (4.9) it follows from (4.5)
that
|
|
|
It is shown in the proof of Theorem in [3] that
. Therefore, the latter inequality
readily implies (4.4) and so we conclude.
∎
Theorem 4.4.
Let be a finite field and assume that acts on as in the beginning of this section. Let also be an admissible polynomial of degree .
Then, for any set , such
that , there exists so that
If, in addition, the action of is ergodic, then
for any sets which satisfy
, there is some with
Proof.
Let . For the second conclusion it suffices to
prove the following averages are positive (for the first conclusion
we prove the same thing with )
|
|
|
|
|
|
|
|
(4.10) |
where . Now, we observe that
|
|
|
(4.11) |
If acts ergodically, then and so (4.11) becomes
|
|
|
(4.12) |
If and we don’t assume ergodicity, then
, where is the projection
onto the space of functions invariant under by
Proposition 4.1. Therefore and it follows by
the Cauchy-Schwarz inequality that
|
|
|
(4.13) |
For the last averages in (4.10) another application of Cauchy-Schwarz’s inequality gives that
|
|
|
(4.14) |
So, from (4.4) in Proposition 4.3 the inequality in (4.14) now becomes
|
|
|
In conclusion, (4.10) implies that
|
|
|
(4.15) |
As we have alluded to in the beginning of this proof, there are now two routs. If acts ergodically, then (4.15) becomes
|
|
|
(4.16) |
and this is positive whenever . If we don’t assume ergodicity and , then we have
|
|
|
(4.17) |
which is positive precisely when
∎
Some quantitative bounds for the set of return times
in the previous theorem–which will be used in the proof of Theorem
1.14 given below and in Section 5–are the following.
Corollary 4.5.
Let be a finite field and assume that acts on by m.p.t. Let also be an admissible polynomial of degree , and . Then, the set of return times satisfies
|
|
|
(4.18) |
If, in addition, the action of is ergodic, then for any and , the set satisfies
|
|
|
(4.19) |
Proof.
By (4.17) we know that
|
|
|
At the same time, implies
that
|
|
|
Combining the two inequalities we see that
|
|
|
and thus
|
|
|
which is (4.18). For the ergodic case we use (4.16) instead of (4.17) and the rest is similar.
∎
We shall conclude this section by proving Theorem 1.14.
Theorem 1.14.
Let be a finite field. Then, if
is an admissible polynomial over of degree and
with , there are , so that and .
Proof.
Consider the action by affine transformations of
on with the normalised counting measure ,
i.e. , for any . Then the action of
is ergodic. Now, for , we let and
By Corollary 4.5 we know that
|
|
|
This means that
|
|
|
(4.20) |
Observe that for we have that
|
|
|
which means that for each there are elements , such that and .
∎
5 A new “colouring trick” and partition regularity for
finite fields
In this section we will adapt the infinite “colouring
trick” presented in Section of [3] in order to
recover a partition regularity result for finite fields,
namely Theorem 1.15,
from weaker density results established in Section
4; essentially from the proof
of Theorem 1.14. We recall Theorem 1.15 for convenience.
Theorem 1.15.
Let be fixed. Then, there is , so
that for a finite field with and
and a polynomial of
, any colouring
contains monochromatic triples of the form .
Proof.
Let , , be fixed and let be any finite field with , for to be determined later. For an -colouring of such a field, we can permute the colours if necessary and assume that . Clearly then, . Next, we pick a number in the following manner. If , we set . Else, we have that and .
Then, we either have that , whence or not and
let . In this fashion we set
|
|
|
Let . We consider the natural measure
preserving action of on (defined coordinate-wise),
with the counting measure given by , for any
. For any , let
|
|
|
the size of which we can bound below by Corollary 4.5, which implies that
|
|
|
(5.1) |
Next, we show that
|
|
|
(5.2) |
Observe that by the definition of it follows that
|
|
|
(5.3) |
Combining (5.1) with (5.3), we see that (5.2) follows from
|
|
|
or equivalently that,
|
|
|
(5.4) |
Using the definition of and it holds that
|
|
|
Now, one can see that
|
|
|
when . If , then the equation becomes .
Finally, (5.4) follows from
|
|
|
(5.5) |
which holds for , with large
enough, since the RHS goes to as , for fixed. By (5.2) we know
that as
|
|
|
Thus, there must exist , such that . Then, if , for , by the definition of and the measure we will also have that
|
|
|
(5.6) |
and hence . This implies the existence of with such that . In particular, for each there are, by (5.6), at least monochromatic triples .
∎
Throughout this short section we will assume that is a countable field and
is a double Følner sequence in .
We also let denote an action of on some probability space
by measure preserving
transformations. For reference, our main goal is to prove the next
result, part of which was initially stated as
Theorem 1.16.
Theorem 6.1.
Let , , and
be as above. Also, we (crucially) further assume that the action
of the additive subgroup is ergodic. Then,
given any , we have that
|
|
|
If, in addition, the action of is ergodic, then for any we have that
|
|
|
The proof is based on the following (double) ergodic theorem.
Theorem 6.2.
Let , , and
be as in the beginning of this section. We further assume that the
action of the additive subgroup is ergodic. Then, for any we have that
|
|
|
where the limit is in .
Proof.
Without loss of generality we assume that and are
real-valued functions. We begin by decomposing as
,
where .
Then,
|
|
|
(6.1) |
As is a constant by the ergodicity of , it follows by (the ergodic) Theorem 2.4 that
|
|
|
Hence, the proof will follow from (6.1) if we can show that
|
|
|
To this end, we let , for . By the van der
Corput trick (see Lemma 2.5) for it suffices to show that
|
|
|
(6.2) |
To this end we note that for ,
|
|
|
|
|
|
|
|
where we have used that preserves . Hence, using the equality (see 2.1), for all , we have
|
|
|
and so it suffices to show that
|
|
|
(6.3) |
By Cauchy-Schwarz’s inequality and Lemma 2.8 the convergence in (6.3) follows from
|
|
|
Now, using Proposition 2.6 with and , for any , , we reduce this to showing that
|
|
|
(6.4) |
for any .
As before we see that
|
|
|
Now, since and for , is a polynomial of degree in , we may use the mean ergodic Theorem 2.4 to obtain that the averages in (6.4) become
|
|
|
(6.5) |
As is ergodic, the projection is a constant and so, using (2.1) and the invariance of under once again, (6.5) becomes
|
|
|
(6.6) |
Because is a double Følner sequence in and it follows by Proposition 2.2 and the mean ergodic theorem that
|
|
|
by the definition of . Therefore, the limit in (6.6) equals zero and so (6.2) follows.
∎
From Theorem 6.2 we can readily recover Theorem 6.1.
Proof of Theorem 6.1.
For we see that
|
|
|
as in the proof of Corollary 3.1.
By Theorem 6.2 for , this limit becomes
|
|
|
(6.7) |
because , is an orthogonal projection and .
For the second part, if in addition acts ergodically, then
and the same method gives the result.
∎
7 Generalization of Shkredov’s theorem
This section is devoted to the proof of Theorem
1.18, which generalizes a
result due to Shkredov pertaining to finite fields of prime
order, as mentioned in Section 1.2. We actually prove the following slightly more general theorem.
Theorem 7.1.
Let be any finite field. Let also be any sets satisfying . Then, there exists such that and .
We have stated Theorem 7.1 for
subsets of because working with an
indicator function of a set
allows us to use
inequalities like , for
all , which simplifies the proof. However, we
do not lose generality as our main result, Theorem
1.18, is an immediate corollary of
Theorem 7.1.
Proof that Theorem 7.1 implies Theorem 1.18.
Let be any sets satisfying
and let , for . Then,
|
|
|
and the right hand side is larger than
|
|
|
where the last inequality holds because
, for any field of order at least . Then the result follows by an application of Theorem 7.1 for the sets .
∎
We now proceed to prove Theorem 7.1.
This proof is an effort to a “finitise” the proof of
Theorem 1.16.
However, there are some additional technicalities here, because
quantities that
vanish in the infinite setting are replaced by
“error” terms which are
bounded (and go to asymptotically as increases to ).
As in the infinite setting, the proof of Theorem
7.1 relies on a finitistic version of the double
ergodic theorem of Theorem 6.2, which is stated in
Proposition 7.3 below. In order to ease
the discussion, we first prove the
following estimate that appears in the proof of the latter.
Proposition 7.2.
Let be any finite field and for some . Then,
|
|
|
Proof.
By Proposition 2.7 we have that for any
|
|
|
Now, as and by (2.1) and preserves , we see that
|
|
|
Observe that we can rewrite this as
|
|
|
(7.1) |
Whenever we have that
|
|
|
|
|
|
|
by definition of |
|
|
|
|
by ergodicity of |
|
|
|
. |
|
|
(7.2) |
Using (2.2) in (7.1) we see that
|
|
|
(7.3) |
Moreover,
|
|
|
(7.4) |
and similarly,
|
|
|
(7.5) |
Now, for each , we have that
|
|
|
and so
|
|
|
Therefore,
|
|
|
(7.6) |
The last inequality follows because the rightmost sum
vanishes for and is non-negative when .
In view of (7.6), the equality in
(7.3) is replaced by
|
|
|
where in the first inequality we also used
(7.4) and (7.5) and the last
inequality holds whenever .
∎
We now prove Proposition 7.3.
Proposition 7.3.
Let be any finite field and let
for some and ,
for some . Then
|
|
|
(7.7) |
Proof.
By Proposition 2.7 and the fact that
preserves for all we see that
|
|
|
As all functions are real-valued, the above can be rewritten
as
|
|
|
Hence, using the Cauchy-Schwarz inequality we see that
|
|
|
(7.8) |
By the triangle inequality, the right hand side in (7.8) is less than or equal to
|
|
|
and then
|
|
|
as and so , by the comments after Theorem 7.1 and the Cauchy-Schwarz inequality. Therefore,
|
|
|
(7.9) |
By an application of Cauchy-Schwarz’s inequality for sums of
products we have that
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
(7.10) |
By Proposition 7.2 we see that
|
|
|
Using this in (7.10) and the bound in
(7.9) we have that
|
|
|
Finally, it follows by the definition of that , as shown in the proof of Theorem in [3]. In conclusion, (7.9) becomes
|
|
|
since whenever .
∎
We are finally in the position to prove the main result of this
section, Theorem 7.1.
Proof of Theorem 7.1.
Using the same notation as in
Section 4, the
assumption of Theorem 7.1 can be
rewritten as and its conclusion is equivalent to the existence of so that
where is the normalised counting measure on . It will thus suffice to show that
Using the fact that preserves for all , this is equivalent to
|
|
|
(7.11) |
We let .
Observe that and
is a constant. Then,
|
|
|
(7.12) |
As it follows by the comments after Theorem 7.1 that
|
|
|
Using this in (7.12), we reduce (7.11) to showing that
|
|
|
Applying the Cauchy-Schwarz inequality the latter follows from showing that
|
|
|
(7.13) |
In Proposition 7.3 we
showed that
|
|
|
and since ,
we see that
(7.13) holds whenever
|
|
|
which is equivalent to our main assumption, namely that .
∎
As a corollary of the proof we get the following
quantitative result.
Corollary 7.4.
Let be any finite field. Let also be any sets satisfying . Then, for each there is a set of cardinality
|
|
|
so that for each there are choices for such that , and .
Proof.
Let for any as above and let
|
|
|
Similarly to the proof of Corollary 4.5, it follows
from the proof of Theorem 7.1 that
|
|
|
(7.14) |
where . By the definition of , (7.14) is equivalent to
|
|
|
(7.15) |
Finally, we see that for each ,
|
|
|
and thus there are choices for
satisfying and .
∎
8 A conditional generalisation of Green and
Sanders’ theorem
In Section 5 we
devised a
finitistic “colouring
trick” to prove Theorem 1.15
from Corollary
4.5. Now, using a similar argument
and a finitistic version of
Conjecture 1.17 as our
basis we will prove a generalisation of
Green and Sanders’ theorem about “monochromatic sums
and products” in finite fields as mentioned in the
introduction.
Before stating the aforementioned conjecture, we make another
related
conjecture that would generalise a special case of Theorem 7.1.
Conjecture 8.1.
Let be any finite field and assume that
acts by m.p.t. on a probability space
. Let be a
set with
, for some constants . Then, there exists such that
|
|
|
For the purpose of proving the generalisation of Green and Sanders’
theorem, that is, Conjecture 1.19, we actually need
only consider a special case of Conjecture
8.1 with and
, some , where is the counting
measure on , and is
a set with , for some
constants .
A way one could try to prove the aforementioned special case of
Conjecture
8.1 would start by decomposing as ,
where . Then, following Section 7 and
considering the inner product
, one would have to show that
|
|
|
(8.1) |
This time is not necessarily a constant, however we still have that
|
|
|
Indeed, as and is an orthogonal projection with we have
|
|
|
where the last inequality is Cauchy-Schwarz. Then, arguing similarly for we have
|
|
|
Therefore, the proof would follow from the following statement,
which is precisely what we are going to use.
Conjecture 8.2.
Let be any finite field and let .
Consider the coordinate-wise affine action of by m.p.t. on , where . Let , where and . Then,
|
|
|
for some .
As a corollary of Conjecture 8.2
we get the following estimates on the set of return times in the
special case of Conjecture 8.1 that we
need. The (conditional) proof is a straightforward adjustment of
the proof of Corollary 7.4 and so we omit it.
Conjecture 8.3.
Let be a finite field and . Assume that acts on by m.p.t. as above. Let and .
Then, the set
|
|
|
satisfies
|
|
|
(8.2) |
We are now in a position to apply a version of the
finitary “colouring trick” and recover Conjecture 1.19, which we recall for convenience.
Conjecture 1.19.
Let be a number of colours. Then, there is , so that
for any finite field with , any colouring
contains monochromatic quadruples
, where is some constant that does not depend on .
Proof.
Let , , be fixed and let be any finite field with , for to be determined later. For an -colouring of such a field we can permute the colours if necessary and assume that . Clearly, then, . Next, we pick a number in the following manner. If , we set . Else, we have that and .
Then, we either have that , whence or not and
let . Proceeding in this fashion we set
|
|
|
Let . We consider the natural measure
preserving action of on (defined coordinate-wise),
with the counting measure given by , for
any
. So, for we have
that
, where is the
normalised counting measure on . For any let
|
|
|
Then, by Corollary 8.3 we have that
|
|
|
which implies that
|
|
|
(8.3) |
We want to bound below the size of ,
because, for any element in this set, it holds that and also that . Then, if , for , by the definition of and the measure we have that and hence , which implies the existence of at least elements
such that . To this end,
by the choice of we have
|
|
|
(8.4) |
Using the definition of and it holds that
|
|
|
(8.5) |
Now,
|
|
|
and so by (8.3), (8.4) and (8.5) we see that
|
|
|
(8.6) |
The quantity at the right hand side of (8.6) can be rewritten as
|
|
|
Now, one can see that
|
|
|
Therefore, the right hand side of (8.6) is greater than or equal to
|
|
|
(8.7) |
which follows by setting
|
|
|
Recall that . We choose large enough to
guarantee that . Since and for any
we have at least
monochromatic quadruples , it follows by (8.7) that there are in total
at least
|
|
|
monochromatic patterns of the form , where is a constant that does
not depend on the size of .
∎