all InfoSec news
USENIX ATC '23 - Accelerating Distributed MoE Training and Inference with Lina
Oct. 4, 2023, 11:58 p.m. | USENIX
USENIX www.youtube.com
Jiamin Li, City University of Hong Kong, Yimin Jiang, ByteDance Inc., Yibo Zhu, Unaffiliated, Cong Wang, City University of Hong Kong, Hong Xu, The Chinese University of Hong Kong
Scaling model parameters improves model quality at the price of high computation overhead. Sparsely activated models, usually in the form of Mixture of Experts (MoE) architecture, have sub-linear scaling of computation cost with model size, thus providing opportunities to …
bytedance chinese city distributed hong kong kong quality scaling training university usenix wang
More from www.youtube.com / USENIX
Jobs in InfoSec / Cybersecurity
Social Engineer For Reverse Engineering Exploit Study
@ Independent study | Remote
Cloud Security Analyst
@ Cloud Peritus | Bengaluru, India
Cyber Program Manager - CISO- United States – Remote
@ Stanley Black & Decker | Towson MD USA - 701 E Joppa Rd Bg 700
Network Security Engineer (AEGIS)
@ Peraton | Virginia Beach, VA, United States
SC2022-002065 Cyber Security Incident Responder (NS) - MON 13 May
@ EMW, Inc. | Mons, Wallonia, Belgium
Information Systems Security Engineer
@ Booz Allen Hamilton | USA, GA, Warner Robins (300 Park Pl Dr)