The vast amount of content on social media platforms makes it extremely challenging to get posts noticed. An increasingly popular approach to increase customer engagement relies on text overlays, where text is placed directly on images. Such practices raise questions of how to balance the visual and text elements for the best impact. Three key factors, commonly used by practitioners, can trigger engagement because of their visual salience: the degree of dynamism or implied motion in images as well as the size and centrality of the text overlay. With multiple methods, including field studies, online experiments, and managerial interviews, the authors establish that a text overlay that is too large and placed centrally, combined with dynamic images, has negative effects on consumer engagement, because these design combinations make the post look visually unappealing. They also leverage these findings to develop an interactive app that can help managers compose more engaging multimodal social media posts.