AI Application Engineer

Torrance, California

QUICK USA, Inc.
Apply for this Job

Summary

A growing startup IT company is seeking a highly skilled AI Application Engineer to join their team. The ideal candidate will play a key role in designing, developing, and deploying API services powered by Large Language Models (LLMs) while integrating seamlessly with existing TypeScript-based backend systems.


Essential Duties

  • Design, develop, and deploy a Python-based API server with LLM
  • Build API endpoints using FastAPI
  • Implement LLM workflows using LangChain/LangGraph libraries
  • Develop vector search functionality with Qdrant DB
  • Implement real-time communication with the frontend using WebSockets
  • Effectively integrate with the existing TypeScript backend
  • Optimize LLM functionality for performance and cost
  • Collaborate with team members to improve the overall system

Working Hours, Working style

Monday - Friday; 8 hours a day

Core hours: 9:30 AM - 2:30 PM (Flexible schedule outside core hours)


Working Location

Irvine, CA


Salary/Benefit

$100K - 140K DOE

  • Health insurance
  • Retirement plan (simple IRA)
  • Paid time off (PTO) & sick leave

Holidays

Saturdays, Sundays, and major US holidays


Qualifications

  • Business-level proficiency in English (spoken and written)
  • 5+ years of professional experience in Python development
  • Proven experience building and deploying production-level applications using LLMs (e.g., GPT, Claude, Azure OpenAI, or Google Gemini)
  • Solid understanding of RESTful API design and development
  • Experience integrating AI features into real-world applications
  • Experience or interest in the following technologies: FastAPI, LangChain / LangGraphQdrant, or other vector databases. WebSocket communication, Basic knowledge of TypeScript and React for integration with frontend systems
  • Familiarity with prompt engineering, fine-tuning, or retrieval-augmented generation (RAG)Hands-on experience with OpenAI, Anthropic, or other model APIs in scalable environments
  • Awareness of AI safety, latency management, and cost optimization strategies
Date Posted: 02 May 2025
Apply for this Job