Week 06 – Research Paper Study & Adaptation

Dates: 2025-07-08 – 2025-07-14 Internship: AI/ML Intern at SynerSense Pvt. Ltd. Mentor: Praveen Kulkarni Sir


Focus

This week was focused on understanding high-impact research papers related to facial recognition and multimodal learning. The insights helped us benchmark approaches and select the right direction for extending our current work — all within NDA constraints.


Goals for the Week

  • Analyze top papers including DeepFace, CelebA, and MS-Celeb-1M
  • Identify architectural and training strategies from published work
  • Extract reusable code components or libraries
  • Build internal notes for adaptation (non-disclosable externally)

Tasks Completed

Task Status Notes
Reviewed DeepFace CVPR 2014 Paper ✅ Completed Extracted high-level architecture and pre-processing
Summarized CelebA Dataset Usage & Purpose ✅ Completed Documented data splits and key attributes
Reviewed MS-Celeb-1M Large-Scale Benchmark ✅ Completed Noted benchmark criteria, dataset challenges
Matched CLIP/VLM output to internal project pipeline ✅ Completed Used vision encoder outputs in internal test framework
Maintained private internal analysis notes (NDA) ✅ Completed For mentor review, not open-sourced

Key Learnings

  • Paper reading deepens understanding of real-world implementations
  • Face verification success hinges on alignment + representation
  • Vision-language models (like CLIP) can be modularized efficiently
  • Datasets vary in complexity, annotation quality, and usage rights

Problems Faced & Solutions

Problem Solution
No direct code in papers Used matching PyPI libraries (e.g., DeepFace)
Understanding legacy methods Referred to official benchmark sites and repos
Need to protect internal insights Maintained internal-only notebooks (under NDA)

📎 References


Goals for Next Week

  • Integrate internal code with vision encoder embeddings
  • Begin training internal pipeline with updated feature vectors
  • Prepare learnings page with clean summaries (non-sensitive)

“Digging into the foundational papers gave me clarity on the ‘why’ behind popular architectures — this shapes how we build our own smarter, compliant solutions.”