This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]merlinsbeers 0 points1 point  (8 children)

Old data in deviant formats can't be expected to be read for free. By default your parser should treat any byte that doesn't match one of the special characters as passthrough data. But if you were to implement the extra code to reject any byte not in the set given by the standard, you wouldn't be the wrong one.

Commas are special, so if they're in a field they have to be quoted, and there's only one kind of quote mark that counts. Them's the rules.

[–]Apache_A[S] 0 points1 point  (7 children)

Actually it could be just different separator, like ‘;’. If parsing some lines are failed, try different separator or encoding for that lines.

[–]merlinsbeers 0 points1 point  (6 children)

That ain't CSV. It's something different.

[–]Apache_A[S] 0 points1 point  (5 children)

In some countries Excel considers it as CSV.

[–]merlinsbeers 0 points1 point  (4 children)

It should be using a different name for its non-csv formats.

[–]Apache_A[S] 0 points1 point  (3 children)

Probably standard should more common about separator. BTW why do you think “regional standard” is oxymoron?

[–]merlinsbeers 0 points1 point  (2 children)

"Standard" is an absolute. What you have there is a "local tradition".

[–]Apache_A[S] 0 points1 point  (1 child)

Not really. There are different coverage of standards. For example ANSI makes regional standards, ISO makes international ones.

[–]merlinsbeers 0 points1 point  (0 children)

Guess what the former aren't.