-
Anticipating User Needs: Insights from Design Fiction on Conversational Agents for Computational Thinking
Authors:
Jacob Penney,
João Felipe Pimentel,
Igor Steinmacher,
Marco A. Gerosa
Abstract:
Computational thinking, and by extension, computer programming, is notoriously challenging to learn. Conversational agents and generative artificial intelligence (genAI) have the potential to facilitate this learning process by offering personalized guidance, interactive learning experiences, and code generation. However, current genAI-based chatbots focus on professional developers and may not ad…
▽ More
Computational thinking, and by extension, computer programming, is notoriously challenging to learn. Conversational agents and generative artificial intelligence (genAI) have the potential to facilitate this learning process by offering personalized guidance, interactive learning experiences, and code generation. However, current genAI-based chatbots focus on professional developers and may not adequately consider educational needs. Involving educators in conceiving educational tools is critical for ensuring usefulness and usability. We enlisted nine instructors to engage in design fiction sessions in which we elicited abilities such a conversational agent supported by genAI should display. Participants envisioned a conversational agent that guides students stepwise through exercises, tuning its method of guidance with an awareness of the educational background, skills and deficits, and learning preferences. The insights obtained in this paper can guide future implementations of tutoring conversational agents oriented toward teaching computational thinking and computer programming.
△ Less
Submitted 13 June, 2024; v1 submitted 12 November, 2023;
originally announced November 2023.
-
Tag that issue: Applying API-domain labels in issue tracking systems
Authors:
Fabio Santos,
Joseph Vargovich,
Bianca Trinkenreich,
Italo Santos,
Jacob Penney,
Ricardo Britto,
João Felipe Pimentel,
Igor Wiese,
Igor Steinmacher,
Anita Sarma,
Marco A. Gerosa
Abstract:
Labeling issues with the skills required to complete them can help contributors to choose tasks in Open Source Software projects. However, manually labeling issues is time-consuming and error-prone, and current automated approaches are mostly limited to classifying issues as bugs/non-bugs. We investigate the feasibility and relevance of automatically labeling issues with what we call "API-domains,…
▽ More
Labeling issues with the skills required to complete them can help contributors to choose tasks in Open Source Software projects. However, manually labeling issues is time-consuming and error-prone, and current automated approaches are mostly limited to classifying issues as bugs/non-bugs. We investigate the feasibility and relevance of automatically labeling issues with what we call "API-domains," which are high-level categories of APIs. Therefore, we posit that the APIs used in the source code affected by an issue can be a proxy for the type of skills (e.g., DB, security, UI) needed to work on the issue. We ran a user study (n=74) to assess API-domain labels' relevancy to potential contributors, leveraged the issues' descriptions and the project history to build prediction models, and validated the predictions with contributors (n=20) of the projects. Our results show that (i) newcomers to the project consider API-domain labels useful in choosing tasks, (ii) labels can be predicted with a precision of 84% and a recall of 78.6% on average, (iii) the results of the predictions reached up to 71.3% in precision and 52.5% in recall when training with a project and testing in another (transfer learning), and (iv) project contributors consider most of the predictions helpful in identifying needed skills. These findings suggest our approach can be applied in practice to automatically label issues, assisting developers in finding tasks that better match their skills.
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
GiveMeLabeledIssues: An Open Source Issue Recommendation System
Authors:
Joseph Vargovich,
Fabio Santos,
Jacob Penney,
Marco A. Gerosa,
Igor Steinmacher
Abstract:
Developers often struggle to navigate an Open Source Software (OSS) project's issue-tracking system and find a suitable task. Proper issue labeling can aid task selection, but current tools are limited to classifying the issues according to their type (e.g., bug, question, good first issue, feature, etc.). In contrast, this paper presents a tool (GiveMeLabeledIssues) that mines project repositorie…
▽ More
Developers often struggle to navigate an Open Source Software (OSS) project's issue-tracking system and find a suitable task. Proper issue labeling can aid task selection, but current tools are limited to classifying the issues according to their type (e.g., bug, question, good first issue, feature, etc.). In contrast, this paper presents a tool (GiveMeLabeledIssues) that mines project repositories and labels issues based on the skills required to solve them. We leverage the domain of the APIs involved in the solution (e.g., User Interface (UI), Test, Databases (DB), etc.) as a proxy for the required skills. GiveMeLabeledIssues facilitates matching developers' skills to tasks, reducing the burden on project maintainers. The tool obtained a precision of 83.9% when predicting the API domains involved in the issues. The replication package contains instructions on executing the tool and including new projects. A demo video is available at https://www.youtube.com/watch?v=ic2quUue7i8
△ Less
Submitted 23 March, 2023;
originally announced March 2023.
-
Ethical Testing in the Real World: Evaluating Physical Testing of Adversarial Machine Learning
Authors:
Kendra Albert,
Maggie Delano,
Jonathon Penney,
Afsaneh Rigot,
Ram Shankar Siva Kumar
Abstract:
This paper critically assesses the adequacy and representativeness of physical domain testing for various adversarial machine learning (ML) attacks against computer vision systems involving human subjects. Many papers that deploy such attacks characterize themselves as "real world." Despite this framing, however, we found the physical or real-world testing conducted was minimal, provided few detai…
▽ More
This paper critically assesses the adequacy and representativeness of physical domain testing for various adversarial machine learning (ML) attacks against computer vision systems involving human subjects. Many papers that deploy such attacks characterize themselves as "real world." Despite this framing, however, we found the physical or real-world testing conducted was minimal, provided few details about testing subjects and was often conducted as an afterthought or demonstration. Adversarial ML research without representative trials or testing is an ethical, scientific, and health/safety issue that can cause real harms. We introduce the problem and our methodology, and then critique the physical domain testing methodologies employed by papers in the field. We then explore various barriers to more inclusive physical testing in adversarial ML and offer recommendations to improve such testing notwithstanding these challenges.
△ Less
Submitted 3 December, 2020;
originally announced December 2020.
-
Legal Risks of Adversarial Machine Learning Research
Authors:
Ram Shankar Siva Kumar,
Jonathon Penney,
Bruce Schneier,
Kendra Albert
Abstract:
Adversarial Machine Learning is booming with ML researchers increasingly targeting commercial ML systems such as those used in Facebook, Tesla, Microsoft, IBM, Google to demonstrate vulnerabilities. In this paper, we ask, "What are the potential legal risks to adversarial ML researchers when they attack ML systems?" Studying or testing the security of any operational system potentially runs afoul…
▽ More
Adversarial Machine Learning is booming with ML researchers increasingly targeting commercial ML systems such as those used in Facebook, Tesla, Microsoft, IBM, Google to demonstrate vulnerabilities. In this paper, we ask, "What are the potential legal risks to adversarial ML researchers when they attack ML systems?" Studying or testing the security of any operational system potentially runs afoul the Computer Fraud and Abuse Act (CFAA), the primary United States federal statute that creates liability for hacking. We claim that Adversarial ML research is likely no different. Our analysis show that because there is a split in how CFAA is interpreted, aspects of adversarial ML attacks, such as model inversion, membership inference, model stealing, reprogramming the ML system and poisoning attacks, may be sanctioned in some jurisdictions and not penalized in others. We conclude with an analysis predicting how the US Supreme Court may resolve some present inconsistencies in the CFAA's application in Van Buren v. United States, an appeal expected to be decided in 2021. We argue that the court is likely to adopt a narrow construction of the CFAA, and that this will actually lead to better adversarial ML security outcomes in the long term.
△ Less
Submitted 29 June, 2020;
originally announced June 2020.
-
Cold Molecular Gas and Free-Free Emission from Hot, Dust-Obscured Galaxies at $z$~3
Authors:
J. I. Penney,
A. W. Blain,
R. J. Assef,
T. Diaz-Santos,
J. J. González-López,
C. -W. Tsai,
M. Aravena,
P. R. M. Eisenhardt,
S. F. Jones,
H. D. Jun,
M. Kim,
D. Stern,
J. Wu
Abstract:
We report on observations of redshifted CO(1-0) line emission and observed-frame $\rm\sim$ 30GHz radio continuum emission from five ultra-luminous, mid-IR selected hot, Dust-Obscured Galaxies (Hot DOGs) at $z\rm\gtrsim$ 3 using the Karl G. Jansky Very Large Array. We detect CO(1-0) line emission in all five Hot DOGs, with one of them at high signal to noise. We analyse FIR-radio spectral energy di…
▽ More
We report on observations of redshifted CO(1-0) line emission and observed-frame $\rm\sim$ 30GHz radio continuum emission from five ultra-luminous, mid-IR selected hot, Dust-Obscured Galaxies (Hot DOGs) at $z\rm\gtrsim$ 3 using the Karl G. Jansky Very Large Array. We detect CO(1-0) line emission in all five Hot DOGs, with one of them at high signal to noise. We analyse FIR-radio spectral energy distributions, including dust, free-free and synchrotron emission for the galaxies. We find that most of the 115 GHz rest-frame continuum is mostly due to synchrotron or free-free emission, with only a potentially small contribution from thermal emission. We see a deficit in the rest-frame 115 GHz continuum emission compared to dusty star-forming galaxies (DSFGs) and sub-millimetre galaxies (SMGs) at high redshift, suggesting that Hot DOGs do not have similar cold gas reserves compared with star-forming galaxies. One target, W2305-0039, is detected in the FIRST 1.4 GHz survey, and is likely to possess compact radio jets. We compare to the FIR-radio correlation, and find that at least half of the Hot DOGs in our sample are radio-quiet with respect to normal galaxies. These findings suggest that Hot DOGs have comparably less cold molecular gas than star-forming galaxies at lower, $z\rm\sim$ 2 redshifts, and are dominated by powerful, yet radio-quiet AGN.
△ Less
Submitted 15 June, 2020;
originally announced June 2020.
-
Politics of Adversarial Machine Learning
Authors:
Kendra Albert,
Jonathon Penney,
Bruce Schneier,
Ram Shankar Siva Kumar
Abstract:
In addition to their security properties, adversarial machine-learning attacks and defenses have political dimensions. They enable or foreclose certain options for both the subjects of the machine learning systems and for those who deploy them, creating risks for civil liberties and human rights. In this paper, we draw on insights from science and technology studies, anthropology, and human rights…
▽ More
In addition to their security properties, adversarial machine-learning attacks and defenses have political dimensions. They enable or foreclose certain options for both the subjects of the machine learning systems and for those who deploy them, creating risks for civil liberties and human rights. In this paper, we draw on insights from science and technology studies, anthropology, and human rights literature, to inform how defenses against adversarial attacks can be used to suppress dissent and limit attempts to investigate machine learning systems. To make this concrete, we use real-world examples of how attacks such as perturbation, model inversion, or membership inference can be used for socially desirable ends. Although the predictions of this analysis may seem dire, there is hope. Efforts to address human rights concerns in the commercial spyware industry provide guidance for similar measures to ensure ML systems serve democratic, not authoritarian ends
△ Less
Submitted 26 April, 2020; v1 submitted 31 January, 2020;
originally announced February 2020.
-
The Environments of Luminous Radio - WISE Selected Infrared Galaxies
Authors:
J. I. Penney,
A. W. Blain,
D. Wylezalek,
N. A. Hatch,
C. Lonsdale,
A. Kimball,
R. J. Assef,
J. J. Condon,
P. R. M. Eisenhardt,
S. F. Jones,
M. Kim,
M. Lacy,
S. I. Muldrew,
S. Petty,
A. Sa**a,
A. Silva,
D. Stern,
T. Diaz-Santos,
C-W. Tsai,
J. Wu
Abstract:
We have observed the environments of a population of 33 heavily dust obscured, ultra-luminous, high-redshift galaxies, selected using WISE and NVSS at $z>$1.3 with the Infra-Red Array Camera on the $Spitzer$ Space Telescope over $\rm5.12\,'\times5.12\,'$ fields. Colour selections are used to quantify any potential overdensities of companion galaxies in these fields. We find no significant excess o…
▽ More
We have observed the environments of a population of 33 heavily dust obscured, ultra-luminous, high-redshift galaxies, selected using WISE and NVSS at $z>$1.3 with the Infra-Red Array Camera on the $Spitzer$ Space Telescope over $\rm5.12\,'\times5.12\,'$ fields. Colour selections are used to quantify any potential overdensities of companion galaxies in these fields. We find no significant excess of galaxies with the standard colour selection for IRAC colours of $\rm[3.6]-[4.5]>-0.1$ consistent with galaxies at $z>$1.3 across the whole fields with respect to wide-area $Spitzer$ comparison fields, but there is a $\rm>2σ$ statistical excess within $\rm0.25\,'$ of the central radio-WISE galaxy. Using a colour selection of $\rm[3.6]-[4.5]>0.4$, 0.5 magnitudes redder than the standard method of selecting galaxies at $z>$1.3, we find a significant overdensity, in which $\rm76\%$ ($\rm33\%$) of the 33 fields have a surface density greater than the $\rm3σ$ ($\rm5σ$) level. There is a statistical excess of these redder galaxies within $\rm0.5\,'$, rising to a central peak $\rm\sim2$--4 times the average density. This implies that these galaxies are statistically linked to the radio-WISE selected galaxy, indicating similar structures to those traced by red galaxies around radio-loud AGN.
△ Less
Submitted 16 November, 2018;
originally announced November 2018.
-
Coherent rotations of a single spin-based qubit in a single quantum dot at fixed Zeeman energy
Authors:
Jordan Kyriakidis,
Stephen J. Penney
Abstract:
Coherent rotations of single spin-based qubits may be accomplished electrically at fixed Zeeman energy with a qubit defined solely within a single electrostatically-defined quantum dot; the $g$-factor and the external magnetic field are kept constant. All that is required to be varied are the voltages on metallic gates which effectively change the shape of the elliptic quantum dot. The pseudospi…
▽ More
Coherent rotations of single spin-based qubits may be accomplished electrically at fixed Zeeman energy with a qubit defined solely within a single electrostatically-defined quantum dot; the $g$-factor and the external magnetic field are kept constant. All that is required to be varied are the voltages on metallic gates which effectively change the shape of the elliptic quantum dot. The pseudospin-1/2 qubit is constructed from the two-dimensional $S=1/2$, $S_z=-1/2$ subspace of three interacting electrons in a two-dimensional potential well. Rotations are created by altering the direction of the pseudomagnetic field through changes in the shape of the confinement potential. By deriving an exact analytic solution to the long-range Coulomb interaction matrix elements, we calculate explicitly the range of magnitudes and directions the pseudomagnetic field can take. Numerical estimates are given for {GaAs}.
△ Less
Submitted 6 March, 2005; v1 submitted 6 July, 2004;
originally announced July 2004.