-
Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination
Authors:
Eve Fleisig,
Genevieve Smith,
Madeline Bossi,
Ishita Rustagi,
Xavier Yin,
Dan Klein
Abstract:
We present a large-scale study of linguistic bias exhibited by ChatGPT covering ten dialects of English (Standard American English, Standard British English, and eight widely spoken non-"standard" varieties from around the world). We prompted GPT-3.5 Turbo and GPT-4 with text by native speakers of each variety and analyzed the responses via detailed linguistic feature annotation and native speaker…
▽ More
We present a large-scale study of linguistic bias exhibited by ChatGPT covering ten dialects of English (Standard American English, Standard British English, and eight widely spoken non-"standard" varieties from around the world). We prompted GPT-3.5 Turbo and GPT-4 with text by native speakers of each variety and analyzed the responses via detailed linguistic feature annotation and native speaker evaluation. We find that the models default to "standard" varieties of English; based on evaluation by native speakers, we also find that model responses to non-"standard" varieties consistently exhibit a range of issues: lack of comprehension (10% worse compared to "standard" varieties), stereoty** (16% worse), demeaning content (22% worse), and condescending responses (12% worse). We also find that if these models are asked to imitate the writing style of prompts in non-"standard" varieties, they produce text that exhibits lower comprehension of the input and is especially prone to stereoty**. GPT-4 improves on GPT-3.5 in terms of comprehension, warmth, and friendliness, but it also results in a marked increase in stereoty** (+17%). The results suggest that GPT-3.5 Turbo and GPT-4 exhibit linguistic discrimination in ways that can exacerbate harms for speakers of non-"standard" varieties.
△ Less
Submitted 13 June, 2024;
originally announced June 2024.
-
Standard Language Ideology in AI-Generated Language
Authors:
Genevieve Smith,
Eve Fleisig,
Madeline Bossi,
Ishita Rustagi,
Xavier Yin
Abstract:
In this position paper, we explore standard language ideology in language generated by large language models (LLMs). First, we outline how standard language ideology is reflected and reinforced in LLMs. We then present a taxonomy of open problems regarding standard language ideology in AI-generated language with implications for minoritized language communities. We introduce the concept of standar…
▽ More
In this position paper, we explore standard language ideology in language generated by large language models (LLMs). First, we outline how standard language ideology is reflected and reinforced in LLMs. We then present a taxonomy of open problems regarding standard language ideology in AI-generated language with implications for minoritized language communities. We introduce the concept of standard AI-generated language ideology, the process by which AI-generated language regards Standard American English (SAE) as a linguistic default and reinforces a linguistic bias that SAE is the most "appropriate" language. Finally, we discuss tensions that remain, including reflecting on what desirable system behavior looks like, as well as advantages and drawbacks of generative AI tools imitating--or often not--different English language varieties. Throughout, we discuss standard language ideology as a manifestation of existing global power structures in and through AI-generated language before ending with questions to move towards alternative, more emancipatory digital futures.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
GAUDI: a preparatory archive for the COROT mission
Authors:
E. Solano,
C. Catala,
R. Garrido,
E. Poretti,
E. Janot-Pacheco,
R. Gutierrez,
R. Gonzalez,
L. Mantegazza,
C. Neiner,
Y. Fremat,
S. Charpinet,
W. Weiss,
P. J. Amado,
M. Rainer,
V. Tsymbal,
D. Lyashko,
D. Ballereau,
J. C. Bouret,
T. Hua,
D. Katz,
F. Lignieres,
T. Luftinger,
P. Mittermayer,
N. Nesvacil,
C. Soubiran
, et al. (12 additional authors not shown)
Abstract:
The GAUDI database (Ground-based Asteroseismology Uniform Database Interface, http://sdc.laeff.esa.es/gaudi/) is a preparatory archive for the COROT (COnvection, ROtation and planetary Transits, http://www.astrsp-mrs.fr/projets/corot/) mission developed at LAEFF (Laboratory for Space Astrophysics and Theoretical Physics, http://www.laeff.esa.es). Its intention is to make the ground-based observa…
▽ More
The GAUDI database (Ground-based Asteroseismology Uniform Database Interface, http://sdc.laeff.esa.es/gaudi/) is a preparatory archive for the COROT (COnvection, ROtation and planetary Transits, http://www.astrsp-mrs.fr/projets/corot/) mission developed at LAEFF (Laboratory for Space Astrophysics and Theoretical Physics, http://www.laeff.esa.es). Its intention is to make the ground-based observations obtained in the preparation of the asteroseismology programme available in a simple and efficient way. It contains spectroscopic and photometric data together with inferred physical parameters for more than 1500 objects gathered since January 1998 in 6 years of observational campaigns. In this paper, the main functionalities and characteristics of the system are described. The observations have been collected at ESO-La Silla, Telescopio Nazionale Galileo, Observatoire de Haute-Provence, South African Astronomical Observatory, Tautenberg Observatory and Sierra Nevada Observatory.
△ Less
Submitted 30 September, 2004;
originally announced September 2004.
-
The multiperiodicity of the $γ$ Doradus stars HD 224945 and HD 224638 as detected from a multisite campaign
Authors:
E. Poretti,
C. Koen,
M. Bossi,
E. Rodriguez,
S. Martin,
K. Krisciunas,
M. C. Akan,
R. Crowe,
M. Wilcox,
C. Ibanoglu,
S. Evren
Abstract:
We discuss new photometric data collected on the gamma Dor variables HD 224945 and HD 224638. Multiperiodicity was detected in both stars, thanks to the clear spectral window of a multisite campaign that involved five observatories. HD 224945 shows the shortest period among the gamma Dor stars, i.e., 0.3330 d. The pulsation behaviour is very different: HD 224945 displays a set of frequencies spr…
▽ More
We discuss new photometric data collected on the gamma Dor variables HD 224945 and HD 224638. Multiperiodicity was detected in both stars, thanks to the clear spectral window of a multisite campaign that involved five observatories. HD 224945 shows the shortest period among the gamma Dor stars, i.e., 0.3330 d. The pulsation behaviour is very different: HD 224945 displays a set of frequencies spread over an interval much wider than that of HD 224638. We clearly found evidence for amplitude variations in the excited modes by comparing data from different years. HD 224945 and HD 224638 are among the best examples of $γ$ Dor stars that show multimode pulsations, which make them very interesting from an asteroseismological point of view.
△ Less
Submitted 14 January, 2002;
originally announced January 2002.
-
Variable stars in nearby galaxies. II. Population I and II Cepheids in Field A of IC 1613
Authors:
E. Antonello,
L. Mantegazza,
D. Fugazza,
M. Bossi
Abstract:
The light curves of Cepheids and other variable stars in Field A of IC 1613, obtained with a CCD and no filter ($Wh$ photometry), have been analyzed. It is possible to separate first overtone from fundamental mode population I Cepheids taking into account the pulsation amplitude, the shape of the light curve and the period. The expected separation is verified in the period--luminosity $PL$ diagr…
▽ More
The light curves of Cepheids and other variable stars in Field A of IC 1613, obtained with a CCD and no filter ($Wh$ photometry), have been analyzed. It is possible to separate first overtone from fundamental mode population I Cepheids taking into account the pulsation amplitude, the shape of the light curve and the period. The expected separation is verified in the period--luminosity $PL$ diagram. Light curve Fourier parameters have been compared with those of Magellanic Clouds and galactic Cepheids, in order to point out the effects of the very low metallicity of IC 1613 on the light curve shape. Population II Cepheids of IC 1613 can be discriminated from those of population I in the $PL$ diagram, and, taking into account their color, from other red or blue variables. Their $PL$ relation is consistent with that observed in globular clusters, nearby dwarf spheroidal galaxies and LMC. We have shown it is possible to apply the single-phase method for deriving standard photometry $PL$ relations for population I and II Cepheids; therefore with just one accurate $BVRI$ observation it is possible to use the population I Cepheids for distance determinations. Some unusual stars have been identified on the basis of periods, light curve shapes and colors; they appear to be pulsating stars laying on the extension of $PL$ relation of known anomalous Cepheids. A firmer classification of these and other faint stars requires further deeper multicolor observations.
△ Less
Submitted 9 September, 1999;
originally announced September 1999.
-
Variable stars in nearby galaxies. I. Search for Cepheids in Field A of IC 1613
Authors:
E. Antonello,
L. Mantegazza,
D. Fugazza,
M. Bossi,
S. Covino
Abstract:
The first results are presented of a four-year program dedicated to the CCD observations of Cepheids in the nearby galaxy IC 1613. Since the program was carried out with a relatively small telescope, the Dutch 0.9 m at ESO-La Silla, the observations were performed without filter (white light), or Wh-band; the advantage of this technique is that the photon statistics correspond to that of V-band…
▽ More
The first results are presented of a four-year program dedicated to the CCD observations of Cepheids in the nearby galaxy IC 1613. Since the program was carried out with a relatively small telescope, the Dutch 0.9 m at ESO-La Silla, the observations were performed without filter (white light), or Wh-band; the advantage of this technique is that the photon statistics correspond to that of V-band observations made with larger telescopes than 2 m and similar exposure time. The effective wavelength of the Wh-band is intermediate between that of V and R bands for stars of A-G spectral type, for back-illuminated CCD detectors. The analysis of the observations of Field A revealed the presence of about 110 variable stars. The detected population I Cepheids are 43; 9 Cepheids were already known from previous works, while most of the new stars have a short period P. For stars with P > 5 d and sufficient phase coverage it is possible to perform good Fourier decomposition of light curves with resulting standard deviation of the fit of 0.02 - 0.04 mag. There are several Cepheids with relatively small amplitude, and most of them are first overtone mode pulsators; the faintest detected Cepheids have V about 23 and P about 1 day. At least 5 population II Cepheids and 8 eclipsing binaries have been observed. The other variable stars are probable long period, semiregular and irregular variables. A comparison with results of other massive CCD photometric projects dedicated to the detection of variable stars shows some advantages of the observations in white light for fully exploiting the capabilities of relatively small telescopes. A suggestion is made on how to use these results for distance determinations.
△ Less
Submitted 29 June, 1999;
originally announced June 1999.
-
Simultaneous intensive photometry and high resolution spectroscopy of delta Scuti stars. III. Mode identifications and physical calibrations in HD 2724
Authors:
M. Bossi,
L. Mantegazza,
N. S. Nunez
Abstract:
On the basis of our new simultaneous photometry and spectroscopy (885 uvby differential measurements in 11 nights and 154 spectrograms of the FeII 4508 A region in 5 nights), we can detect 12 probable periodicities in the variability pattern of this star, determining the frequencies of 7 without any ambiguity. Through a direct fit of pulsational models to our data, we estimate the inclination of…
▽ More
On the basis of our new simultaneous photometry and spectroscopy (885 uvby differential measurements in 11 nights and 154 spectrograms of the FeII 4508 A region in 5 nights), we can detect 12 probable periodicities in the variability pattern of this star, determining the frequencies of 7 without any ambiguity. Through a direct fit of pulsational models to our data, we estimate the inclination of rotational axis to be about 50 deg. and get a reliable identification of 4 modes as well as useful bits of information about the others: no retrograde mode is visible, whereas the star seems to show a certain preference for purely sectorial prograde oscillations. Finally, the attribution of our lowest frequency to the radial fundamental pulsation allows a new calibration of physical parameters. In particular, the gravity can be determined with unusual accuracy and the luminosity evaluation becomes more consistent with the Hipparcos astrometry.
△ Less
Submitted 27 May, 1998;
originally announced May 1998.
-
The Gamma Doradus variables, a new class of pulsating stars. The case of HD 224945
Authors:
E. Poretti,
C. Akan,
M. Bossi,
C. Koen,
K. Krisciunas,
E. Rodriguez
Abstract:
A multisite campaign involving five observatories allowed us to solve the light curve of the Gamma Doradus star HD 224945. The multiperiodic content, the high frequency term at 3.00 c/d and the frequency spacing between the observed terms strongly support its pulsational nature. Considering the low frequency terms observed in Gamma Dor stars, it seems that g--mode pulsators can be found near the…
▽ More
A multisite campaign involving five observatories allowed us to solve the light curve of the Gamma Doradus star HD 224945. The multiperiodic content, the high frequency term at 3.00 c/d and the frequency spacing between the observed terms strongly support its pulsational nature. Considering the low frequency terms observed in Gamma Dor stars, it seems that g--mode pulsators can be found near the cold border of the instability strip.
△ Less
Submitted 6 December, 1996;
originally announced December 1996.