The way I like my games is with only one video/pic per page. (There can be additional pictures in the UI like portraits/backgrounds). So to me the best way to have videos on autoplay and having only one video playing at a time is simply to only have one video per page. I've played too many games (some made with ren'py so the problem was a bit different but the lesson the same) where the devs put a whole bunch of videos only separated with one short line of dialogue (and sometimes the line in question is only some awkward interjections ("oooooohhhhhhhh")). And I've played too many games with way too much repetitive uninteresting text that wouldn't tell me more than what I could already figure from the videos/images. One of the most important (even if subjective) criteria of a good game is the text to image ratio. My advice is if you limit yourself to one vid per page you decrease your chances of putting too much videos and not enough text. I think this tip can help on the QoL department (keeping a clean/enjoyable form) and also in the content department (preventing these extremes of pic/text ratio that really ruin a game). And then if you still want to change the text/pic ratio you can still change the amount of text or the length of the videos.
Also another important advice on the subject :
To me there are 3 important things the player should be able to understand during a sex scene, so 3 things the dev should describe (it is true for any form of art by the way and not only scenes involving sex) :
-What is happening, what are the characters doing precisely. In this case it would be mostly delivered by the videos. BUT if you use multiple videos in one scene (and it will often happen) you have to be sure the things that happen BETWEEN the video is described.
-How the characters are reacting to what is happening. What are they saying ? How are their bodies reacting ?
-And last but not least, probably the thing I personally think is lacking the most in most games and also what excite me the most : What is happening in the characters heads ? That's undoubtedly the hardest part. To me it's important to describe what is happening in the characters minds because it is what will develop the character and give material to your story.
Of course, some part of these can be left to the imagination, and the game will probably be even better if it's the case, but it requires to be a really good artist so my advice to a beginner is to not try to rely on the player's imagination and make sure to have these 3 plans describe enough so the player can have the full picture of the scene.