A fast-growing semiconductor startup is developing a next-generation AI acceleration platform focused on dramatically improving efficiency for modern machine learning workloads, particularly large-scale inference. Define and architect end-to-end compute systems for AI/ML workloads, including hardware requirements and system-level topology. Optimize system performance by balancing compute, memory bandwidth, and interconnect efficiency to improve throughput, latency, and power. Collaborate with software, compiler, and ML teams to enable scalable deployment of AI models through effective abstractions and tooling. Stay current with advancements in AI models, compute architectures, and emerging optimization techniques.
Create an account to see the full posting, access our search engine, and more.You're just 60 seconds away from your new Creativeloft account.