CSV Encoding Issues: UTF-8, BOM, Latin-1, and How to Fix Them
Diagnose and fix CSV encoding problems: BOM markers, mojibake, Windows-1252 vs UTF-8, and how to detect encoding automatically.
Published:
Tags: data, csv, encoding
CSV Encoding Issues: UTF-8, BOM, Latin-1, and How to Fix Them You've received a CSV from a client, a legacy system, or an Excel export. You try to parse it and get: . Or you parse it successfully but the first column header has a prefix you can't delete. Or accented characters like and become garbage like and . These are encoding issues. This guide covers the practical cases: what encoding problems look like, how to detect them, and how to fix them across Python, Node.js, and the browser. -------|--------|-----------| | UTF-8 | Universal | Modern systems, web exports, macOS | | UTF-8 with BOM | Microsoft | Excel "Save As CSV (UTF-8)" | | Latin-1 / ISO-8859-1 | Western European | Legacy systems, older Excel | | Windows-1252 | Microsoft | Windows legacy apps | | UTF-16 | Windows | Rare, but…
All articles · theproductguy.in