[prev in list] [next in list] [prev in thread] [next in thread] 

List:       subversion-issues
Subject:    [Issue 4064] New - UTF-8 files that only contain a byte order mark
From:       fabianschmied () tigris ! org
Date:       2011-11-18 14:55:13
Message-ID: iz4064 () subversion ! tigris ! org
[Download RAW message or body]

http://subversion.tigris.org/issues/show_bug.cgi?id=4064
                 Issue #|4064
                 Summary|UTF-8 files that only contain a byte order mark are ad
                        |ded as binary
               Component|subversion
                 Version|all
                Platform|All
                     URL|http://svn.haxx.se/dev/archive-2011-10/0331.shtml
              OS/Version|All
                  Status|NEW
       Status whiteboard|
                Keywords|
              Resolution|
              Issue type|DEFECT
                Priority|P3
            Subcomponent|cmdline client
             Assigned to|stsp
             Reported by|fabianschmied






------- Additional comments from fabianschmied@tigris.org Fri Nov 18 06:55:12 -0800 2011 -------
When adding a file that is empty except for a UTF-8 byte order mark to an SVN 
repository, that file gets assigned an svn:mime-type property of 
application/octet-stream even though it is clearly meant to be a text file. 
This is a problem because some text editors use the UTF-8 byte order mark to 
identify the encoding of a file and thus automatically add that mark even to 
empty files.

An exemplary tool combination causing this problem to appear is that of 
Microsoft Visual Studio with the VisualSVN add-in because when Visual Studio 
creates an empty code file, VisualSVN will automatically add this file to SVN 
and the code file will thus be identified as a binary file.

The reason for this issue is that the svn_io_detect_mimetype2 (and 
svn_io_is_binary_data) functions have no specific support for detecting the UTF-
8 byte order mark.

The solution discussed on the mailing list would be to patch one of these 
functions so that the byte order mark is not interpreted as binary data.

------------------------------------------------------
http://subversion.tigris.org/ds/viewMessage.do?dsForumId=463&dsMessageId=2880631

To unsubscribe from this discussion, e-mail: [issues-unsubscribe@subversion.tigris.org].
[prev in list] [next in list] [prev in thread] [next in thread] 

Configure | About | News | Add a list | Sponsored by KoreLogic