[prev in list] [next in list] [prev in thread] [next in thread]
List: kde-core-devel
Subject: Re: Review Request: Add more date formatting options
From: "John Layt" <johnlayt () googlemail ! com>
Date: 2009-11-21 2:02:53
Message-ID: 20091121020253.1728.76630 () localhost
[Download RAW message or body]
> On 2009-11-15 15:14:38, Albert Astals Cid wrote:
> > In my opinion, this has a problem and it is that it encourages developers to do \
> > things like
> > QString myDateString = KGlobal::locale()->calendar()->dateFormat( myDate, "%m %d" \
> > );
> > Which has internationalization problems since %m %d is obviously the wrong format \
> > for at least half of the world that uses "%d %m" so i'd like to ask if we really \
> > need that much flexibility and if we do i'd ask you to empathize in documentation \
> > that the code should be something like
> > QString myDateString = KGlobal::locale()->calendar()->dateFormat( myDate, \
> > i18nc("This is a string that defines how date is shown when doing foo in bar \
> > program check with someurl what is the valid syntax", "%m %d") );
>
> John Layt wrote:
> There is doco already in the apidox saying to i18nc() the format string and giving \
> example code, but I'll move it up to the top to make it more prominent.
> There's already plenty of places where people do already do their own format \
> strings by copying the global KLocale and calling setDateFormat(), so it is \
> something that apps do need, and it's better we provide them with a way of doing it \
> that makes it easier for them to get it right, rather then leave it open for them \
> to make a mistake and possibly mess up the global format. Also language is not \
> always a good heuristic for date formats, take for example someone living in the \
> USA but using Spanish language, which way round should it go then? (which I realise \
> is as good an argument against this change being public as for it).
> Putting the method in KCalendarSystem rather than KLocale makes it harder to find \
> but still there for those who really need it.
> If people feel it will be abused, then I can make it private so the only formats \
> available will be the predefined enum KLocale::DateFormat which currently are the \
> ShortDate and LongDate forms set in l10n/KCM. I can then add to the enum some \
> other standard formats like IsoWeekDate and DayMonthOnly that can be passed into \
> formatDate() and readDate(), although this needs more thought and may require \
> separate input and output format enums (all this is already in the 4.4 feature plan \
> but may be more realistic for 4.5). How user maintainable those would be is open \
> to debate, users would need to be able to set the month/day order on some of them \
> at least.
> John Layt wrote:
> P.S. I think I would be more open to readDate() being private only, it's more open \
> to error.
> Albert Astals Cid wrote:
> I agree the language is not the perfect heuristic to get the correct ordering but \
> better than always showing it wrong, right?
> In my opinion enums would be the best solution, Chusslove had patch that did that \
> exactly but don't remember why he stopped pushing for it.
> John Layt wrote:
> OK, I'll make them protected for 4.4 and see what I can do about some enums before \
> the freeze, but that's more likely to come in 4.5. Hmm, that changes my patch for \
> the Plasma clock, I'll have to go back to the old way.
> Actually, I can't make the readDate() protected, it's been public in both KLocale \
> and KCalendarSystem since 4.0, it's only formatDate() that I introduced as new in \
> 4.4.
> John Layt wrote:
> More thoughts after looking at how more apps use date formatting. I'm now inclined \
> to make this public in 4.4, as I think we are making our app developers jump \
> through hoops for no good reason, and they're making mistakes in the process. \
> Where apps do need one-off date formats they should be able to generate them as \
> required in a controlled and standardised way rather than resorting to hacks, \
> workarounds, or their own implementations that may not localise the calendar system \
> or digit set properly.
> In addition to people doing the copy locale trick described above, and a couple of \
> places where apps temporarily change the global locale format (!), there are a lot \
> of places using the string functions to do things like:
> *header = i18nc("Month String - Year String", "%1 %2",
> KGlobal::locale()->calendar()->monthName(album->date(), KCalendarSystem::LongName),
> KGlobal::locale()->calendar()->yearString(album->date()));
>
> And even more complex translation puzzles with 4 or 5 variables and literal \
> characters.
> So people are in effect doing their own date format strings just the long way \
> around, and almost all are being translated already. I don't see much wrong with \
> giving them an easier way to do it that will also be easier and more consistent for \
> the translators to follow and convert to their locally preferred format including \
> literals, e.g. for the above (and the context is almost un-needed now):
> *header = KGlobal::locale()->calendar()->formatDate( album->date()
> i18nc("Month String - Year String", "%B %Y") );
>
> The kdepim guys have even implemented a whole class in KMime::DateFormatter to do \
> date formatting for them using a mixture of named formats in an enum and custom \
> formatting using QDate and even calling strftime() directly (so not KDE localised \
> which is correct for RFC formats, but not if used for any display formats where \
> they could get the wrong calendar system or digit set).
> A couple of plasmoids have also fully re-implemented the formatting standard as a \
> wrapper around KCalendarSystem for their own purposes.
> Thinking about using enums instead, I've been thinking what ones would be needed \
> for the Plasma Clock, which is configurable to optionally display various date \
> portions, so would need at least the following enums:
> DayShortNumber_MonthShortName
> DayShortNumber_MonthShortName_YearNumber
> WeekdayShortName_DayShortNumber_MonthShortName
> WeekdayShortName_DayShortNumber_MonthShortName_YearLongNumber
> WeekdayName
>
> And that's just the short names, other apps will need long name versions, some \
> might want literal chars included, and the list would just get longer and longer as \
> each app adds what they want even if no-one else needs them. Then there's problems \
> around some of those formats not being valid for use with readDate(), so separate \
> enums would be required for read and display, so the numbers double again... \
> Really, it would just end up as a long list of 'approved' date format strings, just \
> with long and unwieldy names people won't want to use, even if they are \
> consistently translated across all of KDE..
> No, I think the global enum should be restricted to recognised and unambiguous \
> standard formats like IsoDate, IsoWeekDate, IsoOrdinalDate, and the various RFC \
> formats.
> To encourage consistency, we could also point people in the apidox to a TechBase \
> page of standard i18nc() calls with standardised contexts and format strings?
> Albert Astals Cid wrote:
> Ok, if i can't convince you about the enums i don't have much more to say, other \
> than good luck trying to fix the mess :D
OK, one more crack at a compromise.
I tried adding a bunch more enums to DateFormat for all the variations people might \
want and I came up with about 20 or so I'd find useful, but the names are just so \
ugly and inflexible and it's bound to miss a format someone wants, it seemed an abuse \
of the purpose of that enum, and didn't really address the root of the problem that \
date order is usually locale based not language based. It doesn't seem very useful \
to me, more like a straightjacket for coders and not very KDE.
Next I tried going the Flags route. This was a new enum DateFormatOption with 16 \
flag values for each of the possible components, e.g. YearLong, YearShort, \
WeekdayLong, WeekdayShort, etc. Problem with this is the there would have been about \
2430 valid combinations (and that's before seperators and order are considered), \
which is fine if generating it via code, but I would have had to type each one out \
for translation, which is just too many. Besides, I couldn't see why a coder would \
find it easier to type "YearLong && MonthNameLong && DayLong && SeparatorSpaces && \
IsoOrder" rather than "%Y %B %d", even if they did get the i18n() for free.
The final option I've tried is a mixture of extending the current DateFormat enum and \
making a flags DateFormatOptions similar to the TimeFormatOptions flags, while also \
adding a DateFormatType enum as well:
/**
* Format for date string.
*
* Some DateFomats can be modifed using DateFormatOptions. The locale
* formats and ISO standard formats are not modified by the options.
*/
enum DateFormat {
ShortDate, /**< Locale Short date format, e.g. 08-04-2007 */
LongDate, /**< Locale Long date format, e.g. Sunday 08 April 2007 */
FancyShortDate, /**< Same as ShortDate for dates a week or more ago. For \
more
recent dates, it is represented as Today, Yesterday, \
or the weekday name. */
FancyLongDate, /**< Same as LongDate for dates a week or more ago. For \
more
recent dates, it is represented as Today, Yesterday, \
or the weekday name. */
IsoDate, /**< ISO-8601 Date format YYYY-MM-DD, e.g. 2009-12-31 */
IsoWeekDate, /**< ISO-8601 Week Date format YYYY-Www-D, e.g. 2009-W01-1 \
*/
IsoOrdinalDate, /**< ISO-8601 Ordinal Date format YYYY-DDD, e.g. 2009-001 \
*/ SimpleDate, /**< Date with short day and month but no year,
in the locale DateFormatType,
e.g. 31/1 or 1/31 */
AbbreviatedDate, /**< Date with short day, month and year
in the locale DateFormatType,
e.g. 31/1/9 or 1/31/9 */
StandardDate, /**< Date with long day, month and year
in the locale DateFormatType,
e.g. 31/01/2009 or 01/31/2009 */
TruncatedDate, /**< Date with long month and year
in the locale DateFormatType,
e.g. 01/2009 */
};
/**
* @since 4.4
*
* Options for modifying the DateFormat, not all formats and options are
* compatible.
*/
enum DateFormatOption {
DateWithShortComponants = 0x01, /**< Format with numbers and names in Short \
format, e.g. 1 or Jan */
DateWithLongComponants = 0x02, /**< Format with numbers and names in Long \
format, e.g. 01 or January */
DateWithMonthNames = 0x04, /**< Format with Month Name instead of number \
*/
DateWithWeekdayNames = 0x08 /**< Format with Weekday Name prefixed */
};
Q_DECLARE_FLAGS(DateFormatOptions, DateFormatOption)
/**
* @since 4.4
*
* Preferred format type for a date, in particular the order for Year,
* Months and Day, and the default separators used. These defaults may be
* overriden by the translators.
*/
enum DateFormatType {
InternationalFormat, /**< Order as Day Month Year
Separate Numeric format with slashes
Separate Name format with spaces */
AmericanFormat, /**< Order as Month Day Year
Separate Numeric format with slashes
Separate Name format with spaces */
IsoFormat /**< Order as Year Month Day
All components in long form
Separate Numeric format with dashes
Separate Name format with spaces */
};
DateFormat gets 4 new formats called Simple, Abbreviated, Standard, and Truncated. \
These can be modified by the DateFormatOptions to have weekday names, month names, \
and short or long format components. They can also be modified by the DateFormatType \
which primarially controls the order and separators, i.e. American or Normal order. \
The DateFormatType will be set in KLocale where it will default to \
InternationalFormat, and be overridden in the us.desktop locale file to \
AmericanFormat. This gives about 87 valid combinations to code and translate, a more \
manageable number while giving simplicity and flexibility for coders.
New API calls are required to support this:
QString formatDate( const QDate &fromDate,
KLocale::DateFormat toFormat,
KLocale::DateFormatOptions formatOptions ) const;
QString formatDate( const QDate &fromDate,
KLocale::DateFormat toFormat,
KLocale::DateFormatOptions formatOptions,
KLocale::DateFormatType formatType ) const;
So most of the time the app coder will only have to write something like the \
following without having to worry about translation or order or anything:
dateString = KGlobal::locale()->calendar()->formatDate(myDate, \
KLocale::SimpleDate);
or to modify the format slightly:
dateString = KGlobal::locale()->calendar()->formatDate(myDate,
KLocale::SimpleDate,
\
KLocale::DateWithWeekdayNames);
of if they really need a particular order:
dateString = KGlobal::locale()->calendar()->formatDate(myDate,
KLocale::SimpleDate,
\
KLocale::DateWithWeekdayNames,
KLocale::AmericanFormat);
I think that's more like what you would like to see? I like that it is easier for \
the coder not having to worry about translations, and gives priority for date order \
to the locale over the language but still allows the language to override if needed. \
However behind the scenes in formatDate() there would be a lot of stuff like:
if ( toFormat == KLocale::SimpleDate ) {
if ( dateOptions & KLocale::DateWithWeekdayNames &&
dateOptions & KLocale::DateWithMonthNames &&
dateOptions & KLocale::DateWithLongComponants ) {
if ( locale()->dateFormatOrder() == KLocale::InternationalFormat ) {
formatString = i18nc("(kdedt-format) SimpleDate \
InternationalFormat", "%a %d %B");
} else if ( locale()->dateFormatOrder() == KLocale::AmericanFormat ) {
formatString = i18nc("(kdedt-format) SimpleDate AmericanFormat", "%a \
%B %d");
} else if ( locale()->dateFormatOrder() == KLocale::IsoFormat ) {
formatString = i18nc("(kdedt-format) SimpleDate IsoOrder", "%a %B \
%d"); }
} else if ( dateOptions & KLocale::DateWithWeekdayNames &&
...
I haven't fully coded this yet, as I don't want to type 87 date strings and their \
unit tests if there's still changes needed. Any suggestions for a better mixture of \
DateFormat and DateFormatOptions?
One thought is if we add more DateFormatTypes to allow for each possible set of \
formatting rules, then we may not need to translate the format string at all as we \
would have all the info we need about separators and order and length to generate the \
right format strings in code to start with? e.g. add ones like EuropeanFormat for \
dd.mm.yyyy, ChineseFormat, JapaneseFormat, and a LanguageFormat one that uses a \
translated version for when different languages in a locale do use different formats \
(e.g. India), etc. That could be a lot less code and translations but would need a \
lot more planning.
Still, it seems like a lot of work for something that won't be used that often? I \
would still leave the full version public anyway for those very rare occasions where \
it really is needed (kdepim, date widgets), just with very big warnings and sample \
code to dissuade people from using it.
- John
-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
http://reviewboard.kde.org/r/2177/#review3100
-----------------------------------------------------------
On 2009-11-15 02:21:15, John Layt wrote:
>
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> http://reviewboard.kde.org/r/2177/
> -----------------------------------------------------------
>
> (Updated 2009-11-15 02:21:15)
>
>
> Review request for kdelibs.
>
>
> Summary
> -------
>
> This change adds a number of new options for date formatting.
>
> 1) New versions of formatDate() and readDate() have already been added to \
> KCalendarSystem in 4.4 to accept a date format string of the coders choice rather \
> than the locale defaults. This saves coders writing something like:
> KLocale *myLocale = new KLocale( *KGlobal::locale() );
> myLocale->setDateFormatShort( "%m %d" );
> QString myDateString = myLocal->dateFormat( myDate, KLocale::ShortDate );
>
> They can now write:
>
> QString myDateString = KGlobal::locale()->calendar()->dateFormat( myDate, "%m %d" \
> );
> This change adds new format codes to these methods to bring them into line with the \
> C / POSIX / CU / GNU standards:
> The following codes are added to formatDate():
>
> %C the 'century' portion of the year
> %j the day of the year number to three digits (e.g. "001" for 1 Jan)
> %V the ISO week of the year number to two digits
> %G the year number in long form of the ISO week of the year
> %g the year number in short form of the ISO week of the year
> %u the day of the week number (e.g. "1" for Monday)
> %D the US short date format (e.g. "%m/%d/%y")
> %F the ISO short date format (e.g. "%Y-%m-%d")
> %x the KDE locale short date format
> %t a tab character
>
> formatDate() also adds the ability to define the case modifier, padding character, \
> and padding width as defined in the GNU version of the standard, e.g. "%05Y" = \
> "02009". See the comprehensive apidox documentation for details.
> There are 2 significant differences between the long-standing KDE implementation \
> and the standard:
> %e in GNU/POSIX is space padded to 2 digits, in KDE is not padded
> %n in GNU/POSIX is newline, in KDE is short month number
>
> Should we fix these to match the standard, or just live with them?
>
> The following codes are added to readDate():
>
> %j the day of the year number
> %V the ISO week of the year number
> %u the day of the week number
>
> Note that only the codes supported by readDate() will be allowed to be used in the \
> locale global date formats, and coders are encouraged to use the locale formats \
> unless really necessary.
> 2) New isValid() and setDate() methods to validate and set the date using either \
> Year and Day of Year, or Year, ISO Week Number, and Day of Week number, as required \
> by the readDate() changes above and as defined in the ISO standard.
> 3) KDE4.3 introduced the ability to use different numeric Digit Sets, e.g. \
> Arabic-Indic or Devenagari instead of Arabic. While some numeric methods in \
> KCalendarSystem have versions to return a string form which returns the correct \
> digit set (e.g. day() and dayString()), many do not have a string version such as \
> weekNumber(). This means a coder needs to know and remember to do the digit set \
> conversion themselves:
> KGlobal::locale()->convertDigits(KGlobal::locale()->calendar()->weekNumber(date), \
> locale()->dateTimeDigitSet());
> This change adds the following convenience methods to improve this:
>
> monthsInYearString()
> weeksInYearString()
> daysInYearString()
> daysInMonthString()
> daysInWeekString()
> dayOfYearString()
> dayOfWeekString()
> weekNumberString()
>
> The existing string methods have all been converted to use the formatDate() method.
>
> 4) A change this big to something so core couldn't be done without significant \
> testing. The first real comprehensive unit tests for formatDate() and readDate() \
> are included. In fact, the test cases took longer to write than the code :-)
> 5) A number of corner-case bugs were uncovered in the existing code by the new test \
> cases which have been fixed as well.
>
> Diffs
> -----
>
> trunk/KDE/kdelibs/kdecore/date/kcalendarsystem.h 1046844
> trunk/KDE/kdelibs/kdecore/date/kcalendarsystem.cpp 1046844
> trunk/KDE/kdelibs/kdecore/date/kcalendarsystemgregorian.cpp 1046844
> trunk/KDE/kdelibs/kdecore/tests/kcalendartest.h 1046844
> trunk/KDE/kdelibs/kdecore/tests/kcalendartest.cpp 1046844
>
> Diff: http://reviewboard.kde.org/r/2177/diff
>
>
> Testing
> -------
>
> Extensive new unit tests written, as attached, which showed up a number of bugs in \
> the existing code. More tests to be added for rest of calendar systems in next few \
> weeks.
>
> Thanks,
>
> John
>
>
[prev in list] [next in list] [prev in thread] [next in thread]
Configure |
About |
News |
Add a list |
Sponsored by KoreLogic