Week 06 – Research Paper Study & Adaptation
Dates: 2025-07-08 – 2025-07-14 Internship: AI/ML Intern at SynerSense Pvt. Ltd. Mentor: Praveen Kulkarni Sir
Focus
This week was focused on understanding high-impact research papers related to facial recognition and multimodal learning. The insights helped us benchmark approaches and select the right direction for extending our current work — all within NDA constraints.
Goals for the Week
- Analyze top papers including DeepFace, CelebA, and MS-Celeb-1M
- Identify architectural and training strategies from published work
- Extract reusable code components or libraries
- Build internal notes for adaptation (non-disclosable externally)
Tasks Completed
Task | Status | Notes |
---|---|---|
Reviewed DeepFace CVPR 2014 Paper | ✅ Completed | Extracted high-level architecture and pre-processing |
Summarized CelebA Dataset Usage & Purpose | ✅ Completed | Documented data splits and key attributes |
Reviewed MS-Celeb-1M Large-Scale Benchmark | ✅ Completed | Noted benchmark criteria, dataset challenges |
Matched CLIP/VLM output to internal project pipeline | ✅ Completed | Used vision encoder outputs in internal test framework |
Maintained private internal analysis notes (NDA) | ✅ Completed | For mentor review, not open-sourced |
Key Learnings
- Paper reading deepens understanding of real-world implementations
- Face verification success hinges on alignment + representation
- Vision-language models (like CLIP) can be modularized efficiently
- Datasets vary in complexity, annotation quality, and usage rights
Problems Faced & Solutions
Problem | Solution |
---|---|
No direct code in papers | Used matching PyPI libraries (e.g., DeepFace) |
Understanding legacy methods | Referred to official benchmark sites and repos |
Need to protect internal insights | Maintained internal-only notebooks (under NDA) |
📎 References
Goals for Next Week
- Integrate internal code with vision encoder embeddings
- Begin training internal pipeline with updated feature vectors
- Prepare learnings page with clean summaries (non-sensitive)
“Digging into the foundational papers gave me clarity on the ‘why’ behind popular architectures — this shapes how we build our own smarter, compliant solutions.”