Optimize, compress, and distill large language and vision models for on-device inference. Build pipelines for distillation and hardware-specific compilation, and benchmark performance across NPU/GPU architectures.
We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.
Responsibilities:
- Compress and optimize large language and vision models for on-device inference.
- Develop pipelines for model distillation and hardware-specific compilation.
- Benchmark performance across various NPU/GPU architectures.
Qualifications:
- Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
- Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
- Strong C++ and Python skills.
Similar Jobs
Agency • Artificial Intelligence • Blockchain • Web3
Run adversarial tests on language and multimodal models, build guardrails and real-time filters for autonomous tool use, and support RLHF alignment and constitutional AI development to ensure safe AI deployment.
Top Skills:
Adversarial MlGuardrailsJailbreak TaxonomiesLlmsMultimodal AgentsPrompt EngineeringReal-Time FilteringRed-Teaming FrameworksRlhf
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Lead design, deployment, and sustainment of IL6S/TPM systems to eliminate losses and improve equipment reliability. Train and coach teams, run Kaizen and DMAIC events, track KPIs (OEE, MTBF/MTTR), implement SOPs and visual management, perform loss analysis, and support preventive/predictive maintenance to drive productivity and safety targets.
Top Skills:
5WhysAutonomous MaintenanceDmaicE2E Data Collection SystemsGeIshikawaKaizenLean Six SigmaMakigamiMtbbMtbfMttrOeeParetoPdcaPredictive MaintenanceRoot Cause Analysis (Rca)SmedStandard WorkTpmValue Stream Mapping (Vsm)Visual ManagementWpi Tool
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Outbound-focused senior account executive responsible for sourcing and closing new restaurant merchant logos. Duties include prospecting, discovery, demos, consultative selling of Square ecosystem, field relationship building, partnering with BD/Product/Marketing, managing the sales cycle and onboarding, and meeting monthly sales KPIs using Salesforce.
Top Skills:
SalesforceSquare
What you need to know about the Calgary Tech Scene
Employees can spend up to one-third of their life at work, so choosing the right company is crucial, not just for the job itself but for the company culture as well. While startups often offer dynamic culture and growth opportunities, large corporations provide benefits like career development and networking, especially appealing to recent graduates. Fortunately, Calgary stands out as a hub for both, recognized as one of Startup Genome's Top 100 Emerging Ecosystems, while also playing host to a number of multinational enterprises. In Calgary, job seekers can find a wide range of opportunities.


.png)