Skip to content
GitLab
Projects Groups Snippets
  • /
  • Help
    • Help
    • Support
    • Community forum
    • Submit feedback
    • Contribute to GitLab
  • Sign in / Register
  • C csvkit
  • Project information
    • Project information
    • Activity
    • Labels
    • Members
  • Repository
    • Repository
    • Files
    • Commits
    • Branches
    • Tags
    • Contributors
    • Graph
    • Compare
  • Issues 61
    • Issues 61
    • List
    • Boards
    • Service Desk
    • Milestones
  • Merge requests 4
    • Merge requests 4
  • CI/CD
    • CI/CD
    • Pipelines
    • Jobs
    • Schedules
  • Deployments
    • Deployments
    • Environments
    • Releases
  • Packages and registries
    • Packages and registries
    • Package Registry
    • Infrastructure Registry
  • Monitor
    • Monitor
    • Incidents
  • Analytics
    • Analytics
    • Value stream
    • CI/CD
    • Repository
  • Wiki
    • Wiki
  • Snippets
    • Snippets
  • Activity
  • Graph
  • Create a new issue
  • Jobs
  • Commits
  • Issue Boards
Collapse sidebar
  • wireservice
  • csvkit
  • Issues
  • #732
Closed
Open
Issue created Dec 28, 2016 by Administrator@rootContributor

1.0.0 can no longer parse my utf-8 files

Created by: perbergland

After upgrading from 0.9.1 to 1.0.0, my files can no longer be parsed with csvkit.

The error message is: Your file is not "utf-8" encoded. Please specify the correct encoding with the -e flag. Use the -v flag to see the complete error.

Attaching a sample file - mdfile.md.zip

Commands I ran on the file:

$ cat mdfile.md
login|role|email|note
---|---|---|---
handle1|member|First Last <[email protected]>|My note
handle2|member|First Last <[email protected]>|My note with ööö utf-8
handle3|member|First Läst <[email protected]>|My note
$ file -I mdfile.md
mdfile.md: text/plain; charset=utf-8
$ csvjson -d '|' <mdfile.md
Your file is not "utf-8" encoded. Please specify the correct encoding with the -e flag. Use the -v flag to see the complete error.
[{"login": "---", "role": "---", "email": "---", "note": "---"}, {"login": "handle1", "role": "member", "email": "First Last <[email protected]>", "note": "My note"}, {"login": "handle2", "role": "member", "email": "First Last <[email protected]>", "note": 

environment:

Mac OS X 10.12.2 Python 2.7.12 pip 9.0.1

$set | grep LC_
local LC_CTYPE=C;
Assignee
Assign to
Time tracking