Mentor Friends

Information / Computer / Technology

AI Engineer – Language Models

Views: 633|Published on: Jan 07, 2026

IT & Telecommunication

Mid Level

Full Time

Vacancy: 1

Kathmandu|

Experience: More than 2 years|

Not Disclosed

Job Description

Roles and responsibilities of this job

Large Language Model (LLM) Expertise

Strong understanding of modern LLM architectures (e.g., Transformer-based models such as GPT, BERT, LLaMA).
Experience working with pretrained models and fine-tuning them for downstream tasks.
Familiarity with inference optimization, model evaluation, prompt engineering, and deployment considerations.
Knowledge of vector databases, embeddings, and retrieval-augmented generation (RAG) is a plus.

Training Data & Dataset Engineering

Solid understanding of dataset curation, preprocessing, augmentation, and quality control.
Experience working with large-scale text datasets, dataset formatting (JSONL, Parquet, etc.), and annotation workflows.
Ability to design training/validation splits, understand data bias, and work with synthetic data generation pipelines.

Python & ML Tooling

Proficiency in Python, with hands-on experience in NLP and ML frameworks such as PyTorch, TensorFlow, Hugging Face Transformers, spaCy, or NLTK.
Ability to write modular, efficient, and production-ready code for training, evaluation, and experimentation.
Familiarity with ML ops, experiment tracking (Weights & Biases, MLflow), and data pipelines is advantageous.

NLP & ML Knowledge

Understanding of classical NLP methods (tokenization, sequence labeling, text classification, embeddings).
Knowledge of evaluation metrics for NLP tasks (BLEU, ROUGE, perplexity, accuracy, F1).
Familiarity with fine-tuning techniques such as LoRA, PEFT, SFT, RLHF is desirable.

Problem-Solving & Analytical Abilities

Strong analytical mindset with the ability to break down complex NLP challenges.
Excellent debugging skills across model behavior, training pipelines, and data issues.
Ability to design experiments, interpret results, and iterate quickly.

Job Specification

Required qualifications for this job

Required Education Level:Under Graduate (Bachelor)

Required Experience:More than 2 years

Other Specification

Soft Skills

Strong written and verbal communication, especially for explaining complex technical topics.
Ability to collaborate closely with data engineers, ML engineers, and product teams.
Self-motivated, innovative, and comfortable working in fast-moving research or product environments.

Preferred Qualifications

Master’s or PhD in Computer Science, Machine Learning, AI, Computational Linguistics, or a related field (or equivalent industry experience).
Experience with cloud platforms for model training (AWS, GCP, Azure).
Understanding of NLP model safety, alignment techniques, and ethical AI considerations.
Candidates must have a minimum of 2 years of relevant experience

Skills Required

Required skills for this job

Azure

Aws

Gcp

Bert

Llama

Lora

Gpt

Peft

Sft

Rlhf

Salary

Offered financial and non-financial compensation for this job

Offered Salary:Not Disclosed

Applying Procedure

Click on Login to Apply and apply to this job via your jobseeker profile with easy apply process.

Note: You need to have merojob registered jobseeker profile to apply via your profile.

Apply Before: Jan 14, 2026

About the organization

Mentor Friends

Information / Computer / Technology

Thapathali, Kathmandu, Nepal

Mentor Friends Pvt. Ltd is an American investment technology company in Nepal to work on the new data infrastructure framework for secure concepts and connections between systems.