In vitro fertilization (IVF) has emerged as a transformative solution for infertility. However, achieving favorable live-birth outcomes remains challenging. Current clinical IVF practices in IVF involve the collection of heterogeneous embryo data through diverse methods, including static images and temporal videos. However, traditional embryo selection methods, primarily reliant on visual inspection of morphology, exhibit variability and are contingent on the experience of practitioners. Therefore, an automated system that can evaluate heterogeneous embryo data to predict the final outcomes of live births is highly desirable.