➀ Proxy is an AI vision controlled long-distance video conferencing robot that enhances the dynamics and interactivity of video calls. It allows hands-free control through gesture recognition. ➁ The robot can be controlled in real-time from anywhere, adding a new dimension to video conferencing. ➂ The prototype was built to address challenges like stability, agility, and network connectivity, utilizing common household items. ➃ Proxy's AI vision control mechanism uses TensorFlow and MoveNet to detect poses and translate them into control signals.