The comm utility in uutils coreutils silently corrupts data by performing lossy UTF-8 conversion on all output lines. The implementation uses String::from_utf8_lossy(), which replaces invalid UTF-8 byte sequences with the Unicode replacement character (U+FFFD). This behavior differs from GNU comm, which processes raw bytes and preserves the original input. This results in corrupted output when the utility is used to compare binary files or files using non-UTF-8 legacy encodings.
Metrics
Affected Vendors & Products
References
History
Wed, 22 Apr 2026 19:15:00 +0000
| Type | Values Removed | Values Added |
|---|---|---|
| Metrics |
ssvc
|
Wed, 22 Apr 2026 16:30:00 +0000
| Type | Values Removed | Values Added |
|---|---|---|
| Description | The comm utility in uutils coreutils silently corrupts data by performing lossy UTF-8 conversion on all output lines. The implementation uses String::from_utf8_lossy(), which replaces invalid UTF-8 byte sequences with the Unicode replacement character (U+FFFD). This behavior differs from GNU comm, which processes raw bytes and preserves the original input. This results in corrupted output when the utility is used to compare binary files or files using non-UTF-8 legacy encodings. | |
| Title | uutils coreutils comm Silent Data Corruption via Lossy UTF-8 Normalization | |
| Weaknesses | CWE-176 | |
| References |
| |
| Metrics |
cvssV3_1
|
Status: PUBLISHED
Assigner: canonical
Published:
Updated: 2026-04-22T18:12:21.735Z
Reserved: 2026-04-02T12:58:56.087Z
Link: CVE-2026-35346
Updated: 2026-04-22T18:12:14.556Z
Status : Awaiting Analysis
Published: 2026-04-22T17:16:36.760
Modified: 2026-04-22T21:23:52.620
Link: CVE-2026-35346
No data.
OpenCVE Enrichment
No data.