HTML to Plain Text: Strip Tags, Preserve Structure, and Handle Entities