January 9, 2024
      
                  
        
  Author(s)
  Sonia  Buckley,   Adam  McCaughan,   Bakhrom  Oripov
 
       
            
    
    
        Training in machine learning necessarily involves more operations than inference only, with higher precision, more memory, and added computational complexity. In hardware, many implementations side-step this issue by designing "inference-only" hardware