Debian Bug report logs - #457489
apt: Convert Packages/Source file from utf-8 to codeset.

version graph

Package: apt; Maintainer for apt is APT Development Team <[email protected]>; Source for apt is src:apt (PTS, buildd, popcon).

Reported by: Kurt Roeckx <[email protected]>

Date: Sat, 22 Dec 2007 19:42:04 UTC

Severity: wishlist

Tags: patch

Found in version apt/0.7.9

Reply or subscribe to this bug.

View this report as an mbox folder, status mbox, maintainer mbox


Report forwarded to [email protected], APT Development Team <[email protected]>:
Bug#457489; Package apt. (full text, mbox, link).


Acknowledgement sent to Kurt Roeckx <[email protected]>:
New Bug report received and forwarded. Copy sent to APT Development Team <[email protected]>. (full text, mbox, link).


Message #5 received at [email protected] (full text, mbox, reply):

From: Kurt Roeckx <[email protected]>
To: [email protected]
Subject: apt: Convert Packages/Source file from utf-8 to codeset.
Date: Sat, 22 Dec 2007 20:34:06 +0100
[Message part 1 (text/plain, inline)]
Package: apt
Version: 0.7.9
Severity: wishlist
Tags: patch

Hi,

It seems that currently apt does not convert the Packages or Sources
file to the codeset from the terminal.  This is probably because there
never was and still isn't a requirement that the debian/control
file is written in a certain encoding.  However most packages
do have their control file written so that they're compatible with UTF-8.
Apt also currently already converts the long description from UTF-8
to the current codeset.

It seems there currently are 11 packages that aren't compatible with
UTF-8:
http://lintian.debian.org/reports/Tdebian-control-file-uses-obsolete-national-encoding.html

There is also a release goad that tries to get rid of those, see
bug #453954.

I've attached a patch that tries to convert it from utf-8 to the
currently selected codeset, and return the original string if it fails.

One effect this change has is that descriptions for any of those packages
now show up fully instead of being truncated at the first char that
isn't UTF-8.  

I might not have written a patch that's does the changes at the right
place because I'm not familiar with the code.  But the changes work for
me.


Kurt

[utf8.diff (text/x-diff, attachment)]

Information forwarded to [email protected], APT Development Team <[email protected]>:
Bug#457489; Package apt. (full text, mbox, link).


Acknowledgement sent to Kurt Roeckx <[email protected]>:
Extra info received and forwarded to list. Copy sent to APT Development Team <[email protected]>. (full text, mbox, link).


Message #10 received at [email protected] (full text, mbox, reply):

From: Kurt Roeckx <[email protected]>
To: [email protected]
Subject: Re: apt: Convert Packages/Source file from utf-8 to codeset.
Date: Sun, 2 Mar 2008 15:42:06 +0100
Hi,

Can someone please comment on this patch and maybe apply it?

We're now in a state where all control files are in UTF-8.


Kurt

On Sat, Dec 22, 2007 at 08:34:06PM +0100, Kurt Roeckx wrote:
> Package: apt
> Version: 0.7.9
> Severity: wishlist
> Tags: patch
> 
> Hi,
> 
> It seems that currently apt does not convert the Packages or Sources
> file to the codeset from the terminal.  This is probably because there
> never was and still isn't a requirement that the debian/control
> file is written in a certain encoding.  However most packages
> do have their control file written so that they're compatible with UTF-8.
> Apt also currently already converts the long description from UTF-8
> to the current codeset.
> 
> It seems there currently are 11 packages that aren't compatible with
> UTF-8:
> http://lintian.debian.org/reports/Tdebian-control-file-uses-obsolete-national-encoding.html
> 
> There is also a release goad that tries to get rid of those, see
> bug #453954.
> 
> I've attached a patch that tries to convert it from utf-8 to the
> currently selected codeset, and return the original string if it fails.
> 
> One effect this change has is that descriptions for any of those packages
> now show up fully instead of being truncated at the first char that
> isn't UTF-8.  
> 
> I might not have written a patch that's does the changes at the right
> place because I'm not familiar with the code.  But the changes work for
> me.
> 
> 
> Kurt
> 





Send a report that this bug log contains spam.


Debian bug tracking system administrator <[email protected]>. Last modified: Sun Sep 22 07:24:43 2024; Machine Name: bembo

Debian Bug tracking system

Debbugs is free software and licensed under the terms of the GNU Public License version 2. The current version can be obtained from https://bugs.debian.org/debbugs-source/.

Copyright © 1999 Darren O. Benham, 1997,2003 nCipher Corporation Ltd, 1994-97 Ian Jackson, 2005-2017 Don Armstrong, and many other contributors.