Back
Applied AI Engineer - LLM & NLP
Full-time
At Toku, we create bespoke cloud communications and customer engagement solutions to reimagine customer experiences for enterprises. As an Applied AI Engineer, you will focus on building, improving, and deploying real-world AI capabilities across speech-to-text, chatbots, and large language model–driven features used in production. This role combines hands-on model development with applied research, translating research insights into practical improvements in live customer-facing systems.
What You Will Be Doing
Train, fine-tune, evaluate, and improve NLP, speech-to-text, and LLM-based models used in production environments.
Work hands-on with chatbots, summarisation, and language understanding features, including retrieval-augmented generation (RAG) and vector-based retrieval.
Read, assess, and experiment with AI/ML research, translating promising ideas into production-ready solutions.
Integrate models into backend services using Python-based APIs, collaborating closely with backend engineers.
Support investigation and resolution of AI-related production issues.
What We'd Love to See
Strong hands-on experience with LLMs, NLP, or speech technologies in real-world or production contexts.
Practical experience with Python-based AI development (e.g. PyTorch and related ecosystems).
Familiarity with RAG, embeddings, and vector-based retrieval systems.
Working knowledge of AWS-based environments and AI tooling (e.g. EC2, SageMaker, MLflow, Docker).



