Cross-layer efforts for energy-efficient computing: towards peta operations per second perwatt