We gratefully acknowledge support from
the Simons Foundation and member institutions.

No authors of 2309.09400 can endorse.

CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages

Thien Nguyen: Is registered as an author of this paper.
Not currently an endorser. (why?)

Chien Van Nguyen, Viet Dac Lai, Hieu Man, Nghia Trung Ngo, Franck Dernoncourt, Ryan A. Rossi and Thien Huu Nguyen are not registered as owners of this paper. (why?)