About

About

About Me

Hi! I’m Ayaan Sharif, an AI Engineer with hands-on experience in developing and deploying large-scale, multimodal AI systems for video, audio, and text analysis. I’m passionate about leveraging cutting-edge AI technologies, including RAG and agentic workflows, to solve complex real-world problems.

Professional Summary

I specialize in building and deploying AI systems that can understand and process multiple types of media simultaneously. My work involves processing extensive datasets (12,000+ movies) and implementing high-throughput inference pipelines on cloud infrastructure using A100/L4 GPUs.

Currently working as a Full Stack AI Engineer at Xfinite Global PLC (erosnow.com), where I develop multimodal AI systems for content analysis and intelligence.

πŸ› οΈ Technical Skills

Programming Languages

  • Python (Primary)
  • SQL for data management
  • JavaScript/TypeScript for full-stack development
  • Bash/Shell Scripting for automation

AI/ML Frameworks & Libraries

  • PyTorch, TensorFlow - Deep learning frameworks
  • Hugging Face Transformers - Pre-trained models
  • Scikit-learn - Machine learning
  • OpenCV - Computer vision
  • Pandas, NumPy - Data manipulation
  • LangGraph - Agentic AI workflows

Backend & Full-Stack Development

  • FastAPI - High-performance API development
  • React.js - Frontend development
  • REST APIs - API design and implementation
  • HTML/CSS - Web technologies

Cloud & DevOps

  • Google Cloud Platform (GCP) - Vertex AI
  • Docker - Containerization
  • Git - Version control
  • Linux (Debian) - System administration

Databases & Specialized Tools

  • Redis - Caching and session management
  • Weaviate - Vector database
  • ImageBind - Multimodal embeddings

Specialized Areas

  • Multimodal AI (Video, Audio, Text processing)
  • Retrieval-Augmented Generation (RAG)
  • Agentic AI Workflows
  • Large Language Models (Gemini)
  • High-Throughput Inference Pipelines
  • Video/Audio Processing (Whisper)

πŸ’Ό Professional Experience

Full Stack AI Engineer

Xfinite Global PLC (erosnow.com) | Mumbai, India (Hybrid)
April 2024 – Present

  • Developed and deployed large-scale multimodal AI systems analyzing video, audio, and text using Gemini, InternVLM, and ImageBind
  • Processed over 12,000 movies for automated scene segmentation, face detection/tracking, event tagging, and compliance analysis
  • Engineered high-throughput inference pipelines on GCP (A100/L4 GPUs) using FastAPI
  • Designed RAG-based workflows for dynamic content summarization, subtitle generation, and semantic movie indexing
  • Contributed to agentic AI pipelines for scalable content intelligence and enhanced user recommendations

πŸš€ Key Projects

Kaivo AI - EdTech AI Platform

February 2024 – Present

An AI-powered educational platform integrating agentic workflows for academic use cases:

  • Architecting multimodal RAG pipelines leveraging Gemini models via Google AI APIs
  • Developed backend infrastructure using FastAPI and Redis
  • Built responsive frontend with React.js
  • Implementing dynamic content generation and contextual search capabilities

Bellabeat Wellness Data Analysis

October 2023 – December 2023

Data analysis case study on Fitbit usage patterns:

  • Performed exploratory data analysis on wellness data using Python, Pandas, Matplotlib
  • Identified key trends in user activity, sleep patterns, and health metrics
  • Created actionable insights for product improvements and marketing strategies

πŸŽ“ Education

B.Tech in AI & Data Science
Mumbai University | Mumbai, India
April 2020 – April 2024

🌟 Interests & Activities

  • Open Source Contributions: Active contributor to AI projects on GitHub and Hugging Face
  • Tech Community: Engaged in AI and data science discussions on Twitter(X) and Discord
  • Linux Enthusiast: Experienced Debian user passionate about system optimization
  • Continuous Learning: Always exploring the latest developments in AI and machine learning

πŸ“« Let’s Connect!

I’m always interested in collaborating on exciting AI projects or discussing new technologies. Feel free to reach out!


Currently seeking opportunities to leverage my expertise in multimodal AI and agentic workflows to tackle challenging real-world problems.