Paypoint Nepal

AI Engineer

Paypoint Nepal

AI Engineer

PayPoint is an innovative and reliable FinTech company focused on transformation of the future of online and offline electronic payments, using various instruments e.g. bank cards, prepaid cards, e-money, mobile wallets, airtime, etc. 

We are committed to deliver excellence via industry best, secure, scalable and reliable solutions to Telecom companies, Airtime providers, Financial institutions, Utility companies, Retailers, e-Commerce providers, etc. 

We operate the full spectrum of currently available technologies to deliver the services through various channels, like Self Service Kiosks, POS devices, Android & iOS App’s, web interfaces, APIs, SMS & USSD messaging, QR codes, and more 

PayPoint is an innovative and reliable FinTech company focused on transformation of the future of online and offline electronic payments, using various instruments e.g. bank cards, prepaid cards, e-money, mobile wallets, airtime, etc. 

We are committed to deliver excellence via industry best, secure, scalable and reliable solutions to Telecom companies, Airtime providers, Financial institutions, Utility companies, Retailers, e-Commerce providers, etc. 

We operate the full …

AI Engineer

Views: 54 | Apply Before: 2 weeks, 1 day from now

Basic Job Information

Job Category : IT & Telecommunication
Job Level : Mid Level
No. of Vacancy/s : [ 2 ]
Employment Type : Full Time
Job Location : Kathmandu
Offered Salary : Not Disclosed
Apply Before(Deadline) : Jul. 04, 2025 23:55 (2 weeks, 1 day from now)

Job Specification

Education Level : Under Graduate (Bachelor)
Experience Required : More than 3 years
Professional Skill Required : Interpersonal Skills SQL PostgreSQL Database Management Python
Other Specification
  • Education: 5+ years of experience for the senior role, and 3–5 years for the middle role.
  • Programming: Python (advanced), FastAPI
  • Databases: SQL, PostgreSQL, Redis
  • LLM & NLP: Hugging Face Transformers, LangChain, LlamaIndex, Toolcall, MCP Embedding & RAG Embedding generation and querying, FAISS, Pinecone, Milvus Model DevelopmentOllama (mandatory), PyTorch, TensorFlow, scikit-learn (preferred)
  • MLOps: Docker, Git (mandatory), MLflow, Airflow, Prefect (preferred)
  • Security: Pydantic, cryptography, OAuth2, data validation

Soft Skills

  • Clear engineering communication and ability to document technical decisions
  • Data-driven decision-making, testing, version control, and rollback management
  • Mandatory: Technical communication in Russian
  • Required: English at B1–C1 proficiency level

Benefits:

  • Competitive salary package
  • Ongoing training and development opportunities
  • Fun and inclusive work culture
  • Opportunities for career growth and advancement within the company

Job Description

This role focuses on integrating artificial intelligence into our production systems, including payment platforms, fraud prevention solutions, and customer interaction tools. The candidate is expected to solve real-world problems using large language models (LLMs), natural language processing (NLP), and machine learning techniques. Responsibilities include end-to-end development: building models from scratch, optimizing them, turning them into APIs, and integrating them with live systems.

This position requires an engineering mindset that goes beyond research to deliver production-grade solutions. Expectations include defining problems, constructing relevant datasets, training models, integrating outputs into products, and scaling systems sustainably. Every code component deployed to production must meet performance, security, and maintainability standards.

The role also involves developing custom LLM-based solutions such as document comparison, text segmentation, RAG-based search, and chatbot architectures, as well as integrating with vector databases (FAISS, Pinecone) and building multi-agent or rule-based logic systems.

Core Responsibilities

  • Design LLM-based systems (chatbots, summarizers, classifiers) and apply embedding-based semantic search
  • Rapid LLM prototyping using Ollama in local environments with low-latency testing
  • Build similarity-based retrieval systems using embeddings and vector databases (FAISS, Pinecone)
  • Implement multi-step user interaction flows with Modular Command Processing (MCP)
  • Enable LLMs to interact with external systems via toolcalling (e.g., databases, APIs, computation services)
  • Use Python for data processing, model prototyping, and building production-level API services
  • Serve models via microservice architecture using FastAPI, with a focus on security and performance
  • Manage model I/O data using SQL and PostgreSQL, optimize query performance
  • Use Redis for caching, session management, and fast-access transient model outputs
  • Apply MLOps principles to manage pipelines, fine-tuning, versioning, and monitoring
  • Fully integrate developed systems into production, follow release and monitoring processes per MLOps standards
  • Align models with business goals and validate with A/B testing
  • Deliver projects end-to-end in direct collaboration with product managers, frontend/backend developers, and data analysts

Similar Jobs
Powered by Merojob AI

Job Action

Similar Jobs
Powered by Merojob AI
job_detail_page
Search, Apply & Get Job: FREE