Developed a novel multimodal approach to robotic manipulation by integrating audio cues with visual information to tackle visually challenging scenarios such as occluded scenes and collision detection Quick Links: GitHub Report Slide Deck