-
Notifications
You must be signed in to change notification settings - Fork 6
Open
Description
In DiverseVul, there are over 1,000 samples with func
fields that end with CR (\r
) characters. This is unexpected because in most operating systems, line breaks are CRLF or LF (ending with \n
). Although the line break sequences do not impact the actual content, it may cause data inconsistency when using Pandas to save/load these uncommon strings. Please refer to pandas-dev/pandas#10018
about this bug.
A simple solution would be to remove the trailing whitespaces from the func
fields. By doing so, we can get rid of this error without altering the dataset's content.
Metadata
Metadata
Assignees
Labels
No labels