Skip to main content

Showing 1–5 of 5 results for author: Stoica, G

Searching in archive cs. Search in all archives.
.
  1. Analyzing domain shift when using additional data for the MICCAI KiTS23 Challenge

    Authors: George Stoica, Mihaela Breaban, Vlad Barbu

    Abstract: Using additional training data is known to improve the results, especially for medical image 3D segmentation where there is a lack of training material and the model needs to generalize well from few available data. However, the new data could have been acquired using other instruments and preprocessed such its distribution is significantly different from the original training data. Therefore, we… ▽ More

    Submitted 11 March, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution is published in https://link.springer.com/book/10.1007/978-3-031-54806-2, and is available online at https://doi.org/10.1007/978-3-031-54806-2_4

    Journal ref: Kidney and Kidney Tumor Segmentation. KiTS 2023. Lecture Notes in Computer Science, vol 14540

  2. arXiv:2305.03053  [pdf, other

    cs.CV cs.LG

    ZipIt! Merging Models from Different Tasks without Training

    Authors: George Stoica, Daniel Bolya, Jakob Bjorner, Pratik Ramesh, Taylor Hearn, Judy Hoffman

    Abstract: Typical deep visual recognition models are capable of performing the one task they were trained on. In this paper, we tackle the extremely difficult problem of combining distinct models with different initializations, each solving a separate task, into one multi-task model without any additional training. Prior work in model merging permutes one model to the space of the other then averages them t… ▽ More

    Submitted 12 March, 2024; v1 submitted 4 May, 2023; originally announced May 2023.

  3. arXiv:2208.00815  [pdf, ps, other

    cs.LG cs.AI cs.CV

    Dynamic Batch Adaptation

    Authors: Cristian Simionescu, George Stoica, Robert Herscovici

    Abstract: Current deep learning adaptive optimizer methods adjust the step magnitude of parameter updates by altering the effective learning rate used by each parameter. Motivated by the known inverse relation between batch size and learning rate on update step magnitudes, we introduce a novel training procedure that dynamically decides the dimension and the composition of the current update step. Our proce… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

  4. arXiv:2104.08398  [pdf, ps, other

    cs.CL

    Re-TACRED: Addressing Shortcomings of the TACRED Dataset

    Authors: George Stoica, Emmanouil Antonios Platanios, Barnabás Póczos

    Abstract: TACRED is one of the largest and most widely used sentence-level relation extraction datasets. Proposed models that are evaluated using this dataset consistently set new state-of-the-art performance. However, they still exhibit large error rates despite leveraging external knowledge and unsupervised pretraining on large text corpora. A recent study suggested that this may be due to poor dataset qu… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  5. arXiv:2012.04812  [pdf, other

    cs.CL

    Improving Relation Extraction by Leveraging Knowledge Graph Link Prediction

    Authors: George Stoica, Emmanouil Antonios Platanios, Barnabás Póczos

    Abstract: Relation extraction (RE) aims to predict a relation between a subject and an object in a sentence, while knowledge graph link prediction (KGLP) aims to predict a set of objects, O, given a subject and a relation from a knowledge graph. These two problems are closely related as their respective objectives are intertwined: given a sentence containing a subject and an object o, a RE model predicts a… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.