Abstract: We propose a photonic computing core that supports simultaneous parallel computing of two independent matrix-vector multiplications, with an ultra-compact ...
The UC Berkeley crew has now shown the value of AI-based optimization work by having OpenEvolve work out a more efficient approach to load balancing across GPUs handling LLM inference.