Semi-automatic staging area for high-quality structured data extraction from scientific literature
Authors:
Luca Foppiano,
Tomoya Mato,
Kensei Terashima,
Pedro Ortiz Suarez,
Taku Tou,
Chikako Sakai,
Wei-Sheng Wang,
Toshiyuki Amagasa,
Yoshihiko Takano,
Masashi Ishii
Abstract:
We propose a semi-automatic staging area for efficiently building an accurate database of experimental physical properties of superconductors from literature, called SuperCon2, to enrich the existing manually-built superconductor database SuperCon. Here we report our curation interface (SuperCon2 Interface) and a workflow managing the state transitions of each examined record, to validate the data…
▽ More
We propose a semi-automatic staging area for efficiently building an accurate database of experimental physical properties of superconductors from literature, called SuperCon2, to enrich the existing manually-built superconductor database SuperCon. Here we report our curation interface (SuperCon2 Interface) and a workflow managing the state transitions of each examined record, to validate the dataset of superconductors from PDF documents collected using Grobid-superconductors in a previous work. This curation workflow allows both automatic and manual operations, the former contains ``anomaly detection'' that scans new data identifying outliers, and a ``training data collector'' mechanism that collects training data examples based on manual corrections. Such training data collection policy is effective in improving the machine-learning models with a reduced number of examples. For manual operations, the interface (SuperCon2 interface) is developed to increase efficiency during manual correction by providing a smart interface and an enhanced PDF document viewer. We show that our interface significantly improves the curation quality by boosting precision and recall as compared with the traditional ``manual correction''. Our semi-automatic approach would provide a solution for achieving a reliable database with text-data mining of scientific documents.
△ Less
Submitted 16 November, 2023; v1 submitted 19 September, 2023;
originally announced September 2023.
Best Thermoelectric Efficiency of Ever-Explored Materials
Authors:
Byungki Ryu,
Jaywan Chung,
Masaya Kumagai,
Tomoya Mato,
Yuki Ando,
Sakiko Gunji,
Atsumi Tanaka,
Dewi Yana,
Masayuki Fujimoto,
Yoji Imai,
Yukari Katsura,
SuDong Park
Abstract:
A thermoelectric device is a heat engine that directly converts heat into electricity. Many materials with a high figure of merit ZT have been discovered in anticipation of a high thermoelectric efficiency. However, there has been a lack of investigations on efficiency-based material evaluation, and little is known about the achievable limit of thermoelectric efficiency. Here, we report the highes…
▽ More
A thermoelectric device is a heat engine that directly converts heat into electricity. Many materials with a high figure of merit ZT have been discovered in anticipation of a high thermoelectric efficiency. However, there has been a lack of investigations on efficiency-based material evaluation, and little is known about the achievable limit of thermoelectric efficiency. Here, we report the highest thermoelectric efficiency using 12,645 published materials. The 97,841,810 thermoelectric efficiencies are calculated using 808,610 device configurations under various heat-source temperatures (T_h) when the cold-side temperature is 300 K, solving one-dimensional thermoelectric integral equations with temperature-dependent thermoelectric properties. For infinite-cascade devices, a thermoelectric efficiency larger than 33% (~1/3) is achievable when T_h exceeds 1400 K. For single-stage devices, the best efficiency of 17.1% (~1/6) is possible when T_h is 860 K. Leg segmentation can overcome this limit, delivering a very high efficiency of 24% (~1/4) when T_h is 1100 K.
△ Less
Submitted 14 March, 2023; v1 submitted 17 October, 2022;
originally announced October 2022.