Publications
Publications in chronological order.
2024
- Reliability and Security of AI HardwareIn 2024 IEEE European Test Symposium (ETS), 2024
- Special Session: Reliability Assessment Recipes for DNN AcceleratorsIn 2024 IEEE 42nd VLSI Test Symposium (VTS), 2024
- Cross-Layer Reliability Evaluation and Efficient Hardening of Large Vision Transformers ModelsIn Design Automation Conference (DAC), Jun 2024
- Transient Fault Tolerant Semantic Segmentation for Autonomous DrivingIn UNCV 2024 - 3rd Workshop on Uncertainty Quantification for Computer Vision, Sep 2024
- Combining Fault Simulation and Beam Data for CNN Error Rate Estimation on RISC-V Commercial PlatformsIn 2024 IEEE 30th International Symposium on On-Line Testing and Robust System Design (IOLTS), Sep 2024
- Can GPU performance increase faster than the code error rate?The Journal of Supercomputing, Sep 2024
- Impact of High-Level Synthesis on Reliability of Artificial Neural Network Hardware AcceleratorsIEEE Transactions on Nuclear Science, Sep 2024
- Assessing the Impact of Compiler Optimizations on GPUs ReliabilityACM Trans. Archit. Code Optim., Feb 2024
2023
- Understanding the Effects of Permanent Faults in GPU’s Parallelism Management and Control UnitsIn Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, Feb 2023
- Understanding and Improving GPUs’ Reliability Combining Beam Experiments with Fault SimulationIn 2023 IEEE International Test Conference (ITC), Feb 2023
- Reliability evaluation of Convolutional Neural Network’s basic operations on a RISC-V processorIn NSREC 2023 - IEEE Nuclear & Space Radiation Effects Conference, Jul 2023
- Impact of High-Level-Synthesis on Reliability of Neural Network Hardware AcceleratorsIn NSREC 2023 - IEEE Nuclear & Space Radiation Effects Conference, Jul 2023
- Understanding the Effects of Permanent Faults in GPU’s Parallelism Management and Control UnitsIn ACM/IEEE International Conference for High Performance Computing, Networking, Storage, and Analysis, Nov 2023
- Neutron-Induced Error Rate of Vision Transformer Models on GPUsIn RADECS - RADiation and its Effects on Components and Systems Conference, Sep 2023
2022
- Performance-Reliability Trade-Off in Graphics Processing UnitsIn RADiation Effects on Components and Systems (RADECS), Oct 2022
- Transient-Fault-Aware Design and Training to Enhance DNNs Reliability with Zero-OverheadIn 2022 IEEE 28th International Symposium on On-Line Testing and Robust System Design (IOLTS), Oct 2022
- Experimental evaluation of neutron-induced errors on a multicore RISC-V platformIn 2022 IEEE 28th International Symposium on On-Line Testing and Robust System Design (IOLTS), Oct 2022
- Evaluating the Impact of Mixed-Precision on Fault Propagation for Deep Neural Networks on GPUsIn 2022 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), Oct 2022
- A Multi-level Approach to Evaluate the Impact of GPU Permanent Faults on CNN’s ReliabilityIn 2022 IEEE International Test Conference (ITC), Oct 2022
- Experimental Findings on the Sources of Detected Unrecoverable Errors in GPUsIEEE Transactions on Nuclear Science, Oct 2022
- An Effective Method to Identify Microarchitectural Vulnerabilities in GPUsIEEE Transactions on Device and Materials Reliability, Oct 2022
- Characterizing a Neutron-Induced Fault Model for Deep Neural NetworksIEEE Transactions on Nuclear Science, Oct 2022
2021
- Unveiling GPU Vulnerabilities: Comparing and Combining Beam, Fault Simulation, and ProfilingIn 35th IEEE International Parallel and Distributed Processing Symposium IPDPS, May 2021
- Revealing GPUs Vulnerabilities by Combining Register-Transfer and Software-Level Fault InjectionIn 2021 51st Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), May 2021
- Combining Architectural Simulation and Software Fault Injection for a Fast and Accurate CNNs Reliability Evaluation on GPUsIn 2021 IEEE 39th VLSI Test Symposium (VTS), May 2021
- Experimental Findings on the Sources of Detected Unrecoverable Errors in GPUsIn 2021 IEEE Nuclear & Space Radiation Effects Conference, May 2021
- Protecting GPU’s Microarchitectural Vulnerabilities via Effective Selective HardeningIn 2021 IEEE 27th International Symposium on On-Line Testing and Robust System Design (IOLTS), May 2021
- Reduced Precision DWC: an Efficient Hardening Strategy for Mixed-Precision ArchitecturesIEEE Transactions on Computers, Feb 2021
2020
- Reduced-Precision DWC for Mixed-Precision GPUsIn 2020 IEEE 26th International Symposium on On-Line Testing and Robust System Design (IOLTS), Feb 2020
- An Overview of the Risk Posed by Thermal Neutrons to the Reliability of Computing DevicesIn 2020 50th Annual IEEE-IFIP International Conference on Dependable Systems and Networks-Supplemental Volume (DSN-S), Feb 2020
- Thermal Neutrons: a Possible Threat for Supercomputers and Safety Critical ApplicationsIn 2020 IEEE European Test Symposium (ETS), Feb 2020
- High-Energy vs. Thermal Neutron Contribution to Processor and Memory Error RatesIEEE Transactions on Nuclear Science, 2020
- Thermal neutrons: a possible threat for supercomputer reliabilityThe Journal of Supercomputing, May 2020
- Impact of Tensor Cores and Mixed Precision on the Reliability of Matrix Multiplication in GPUsIEEE Transactions on Nuclear Science, May 2020
2019
- Reliability Evaluation of Mixed-Precision ArchitecturesIn 2019 IEEE International Symposium on High Performance Computer Architecture (HPCA), Feb 2019
- Impact of Reduced Precision in the Reliability of Deep Neural Networks for Object DetectionIn 2019 IEEE European Test Symposium (ETS), May 2019
- Detecting Errors in Convolutional Neural Networks Using Inter Frame Spatio-Temporal CorrelationIn 2019 IEEE 25th International Symposium on On-Line Testing and Robust System Design (IOLTS), Jul 2019
- Selective Fault Tolerance for Register Files of Graphics Processing UnitsIEEE Transactions on Nuclear Science, Jul 2019
2018
- Code-Dependent and Architecture-Dependent Reliability BehaviorsIn 2018 48th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN), Jun 2018
- Special session: How approximate computing impacts verification, test and reliabilityIn 2018 IEEE 36th VLSI Test Symposium (VTS), Apr 2018
- Analyzing and Increasing the Reliability of Convolutional Neural Networks on GPUsIEEE Transactions on Reliability, 2018
- Kernel and Layer Vulnerability Factor to Evaluate Object Detection Reliability in GPUsIET Computers & Digital Techniques, Sep 2018
2017
- Kernel vulnerability factor and efficient hardening for histogram of oriented gradientsIn 2017 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems (DFT), Oct 2017
- Analyzing the Criticality of Transient Faults-induced SDCS on GPU ApplicationsIn Proceedings of the 8th Workshop on Latest Advances in Scalable Algorithms for Large-Scale Systems, Oct 2017
- Evaluation and Mitigation of Soft-Errors in Neural Network-Based Object Detection in Three GPU ArchitecturesIn 2017 47th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W), Jun 2017
- Radiation-Induced Error Criticality in Modern HPC Parallel AcceleratorsIn 2017 IEEE International Symposium on High Performance Computer Architecture (HPCA), Feb 2017
2016
- Radiation Sensitivity Evaluation of Pedestrian Detection AlgorithmRadiation and its Effects on Components and Systems (RADECS), Sep 2016
- Input Size Effects on the Radiation-Sensitivity of Modern Parallel ProcessorsIn 2016 IEEE Radiation Effects Data Workshop (REDW), Jul 2016
- Evaluation of Histogram of Oriented Gradients Soft Errors Criticality for Automotive ApplicationsACM Trans. Archit. Code Optim., Nov 2016
- Performance and energy efficiency analysis of HPC physics simulation applications in a cluster of ARM processorsConcurrency and Computation: Practice and Experience, Nov 2016cpe.4014
2015
- Análise da Eficiência Energética de uma Aplicaç ao HPC de Geofısica em um Cluster de Baixo Consumo16th WSCAD - Simpósio em Sistemas Computacionais de Alto Desempenho, Nov 2015