Item request has been placed! ×
Item request cannot be made. ×
loading  Processing Request

Efficient neural network processing via model compression and low-power functional units ; Model sıkıştırma ve düşük güç fonksiyonel ünitelerle verimli sinir ağı işleme

Item request has been placed! ×
Item request cannot be made. ×
loading   Processing Request
  • Additional Information
    • Publication Date:
      2024
    • Collection:
      Bilkent University: Institutional Repository
    • Abstract:
      Cataloged from PDF version of article. ; Includes bibliographical references (leaves 63-69). ; We present a framework that contributes neural network optimization through novel methods in pruning, quantization, and arithmetic unit design for resource-constrained devices to datacenters. The first component is a pruning method that employs an importance metric to measure and selectively eliminate less critical neurons and weights, achieving high compression rates up to 99.9% without sacrificing significant accuracy. This idea is improved by a novel pruning schedule that optimizes the balance between compression and model’s generalization capa-bility. Next, we introduce a quantization method that combines with pruning to improve hardware compatibility for floating point format, offering efficient model compression and fast computation and general usability. Finally, we propose a logarithmic arithmetic unit that designed as an energy-efficient alternative to conventional floating-point operations, providing precise and configurable processing without relying on bulky lookup tables. Extensive evaluations across different datasets and CUDA-based simulations and Verilog based hardware designs indicate that our approaches outperforms existing methods, making it a powerful solution for deploying artificial intelligence models more efficiently. ; by Ali Necat Karakuloğlu
    • File Description:
      xiv, 89 leaves : illustrations, charts; 30 cm.; application/pdf
    • Relation:
      https://hdl.handle.net/11693/115942; B149015
    • Online Access:
      https://hdl.handle.net/11693/115942
    • Rights:
      info:eu-repo/semantics/openAccess
    • Accession Number:
      edsbas.FC103C0A