Commit Graph

214 Commits

Author SHA1 Message Date
Henri Sivonen
b7043eff75 Test for bug 638318 - Test that a byte run that has more than 1024 zeros but one non-zero byte among them before the 1024 byte limit does not sniff as UTF-16LE/BE. r=bzbarsky, a=test-only. 2011-03-03 19:20:00 +02:00
Henri Sivonen
f68ddc8cba Bug 631751 - Sniff Basic Latin BOMless UTF-16 for IE compat. r=bzbarsky, a=bzbarsky. 2011-02-16 08:40:35 +02:00
Henri Sivonen
bbffca4d43 Bug 620106 - Allow chardet (if enabled) reload the page during load. r=sicking, a=blocking2.0-final. 2011-02-11 12:33:32 +02:00
Benjamin Smedberg
a7e67d32ad Bug 568691 part B - mechanical changes to in-tree binary modules needed to get them building and registering correctly. After this patch, xpcshell appears to work. 2010-06-10 14:11:40 -04:00
timeless@mozdev.org
251ea41baa Bug 552235 - Fixing warning in universalchardet array subscript has type 'char', r=smontagu 2010-03-13 18:16:00 -08:00
tyler.downer@gmail.com
d22af6cdd4 Remove extension/universalchardet/doc/. Bug 471480, r=smontagu 2010-03-13 12:08:57 -08:00
Ms2ger@gmail.com
5024dcc2e9 Bug 548073 - Tabs in extensions/universalchardet/tests/CharsetDetectionTests.js. r=smontagu 2010-03-06 16:36:05 +01:00
Simon Montagu
c38fa14fa2 Set a minimum threshold even for the detector's preferred language. Bug 547487, r=emk 2010-02-23 01:46:15 -08:00
Jonathan Griffin
b10554d803 Bug 542660. Move universalchardet tests that use XPConnect to mochi-chrome. r=smontagu 2010-02-03 13:35:15 -08:00
Mitchell Field
41c8798093 Bug 514665 - fix stupid module name differences on windows. r=ted 2009-09-25 18:41:16 +02:00
Benjamin Smedberg
ba372f3a4c Followup to bug 398573 - remove REQUIRES from the tree since it is no longer used... automatically generated patch, rs=ted 2009-08-25 08:59:31 -07:00
timeless@mozdev.org
6f207b45f2 Bug 493781 Fixing warning in universalchardet array subscript has type 'char'
r=smontagu
2009-06-27 18:42:50 -07:00
Hiroyuki Ikezoe
06a390a40b Bug 488426 - enable TIS-620 detector; r=smontagu 2009-06-24 13:12:36 +09:00
Hiroyuki Ikezoe
77ac7ec630 Bug 482558. Remove obsolete test. r=smontagu 2009-05-18 10:27:40 +12:00
L. David Baron
f0c43ecb3d Switch HTML mochitests from using MochiKit.js to packed.js. (Bug 490955) r=sayrer 2009-05-06 13:46:04 -07:00
Simon Montagu
0ef190bc50 Fix some issues found with valgrind. Bug 479759 and bug 481692. r=Masatoshi Kimura <VYV03354@nifty.ne.jp> 2009-03-10 23:12:52 +02:00
Zeev Tarantov
00b0b9ec63 Make data tables in extensions/universalchardet const. Bug 482227, r=smontagu 2009-03-09 17:20:32 +02:00
Masatoshi Kimura
5517fb7970 Bug 470635 - Remove unused state machines 2008-12-27 19:02:48 +01:00
Masatoshi Kimura
3a3a37f47d Bug 335531 - Correct misuse of UTF-16BE, UTF-16LE, UTF-32BE, and UTF-32LE charset labels; r=smontagu sr=dbaron 2008-12-06 11:08:26 -08:00
Daniel Holbert
61f320aa9d Backed out changeset 78d662c2c878 (Bug 335531) on suspicion of causing mochitest failures in test_bug399284.html on linux & windows unittest boxes. 2008-12-05 11:52:01 -08:00
Masatoshi Kimura
31c5df869e Bug 335531 - Correct misuse of "UTF-16BE", "UTF-16LE", "UTF-32BE", and "UTF-32LE" charset labels; r=smontagu sr=dbaron 2008-12-05 18:31:00 +01:00
Phil Ringnalda
538c9fb42a Bug 461888 - Remove unused PACKAGE_FILE and PACKAGE_VARS and .pkg files, mozilla-central part, r=bsmedberg 2008-11-03 19:46:28 -08:00
Simon Montagu
4fa9ab24c0 Don't require a minimum data threshold for encodings that correspond to the selected language. Bug 431054, r=Masatoshi Kimura (:emk) <VYV03354@nifty.ne.jp> 2008-10-30 10:57:25 -07:00
Neil Rashbrook
898fd2b8d9 Remove <CR>s (to fix cross-compilation bustage) rs=bsmedberg,gavin,roc,smontagu b=372802 2008-09-20 20:45:25 +01:00
Ted Mielczarek
ff7706a544 bug 447642 - get rid of --disable-mochitest, replace MOZ_MOCHITEST with ENABLE_TESTS. r=bsmedberg 2008-09-09 13:59:11 -04:00
Simon Montagu
ca1503f03f Unit test for bug 9357 (Autodetection of EUC-KR) 2008-06-22 01:28:42 -07:00
smontagu@smontagu.org
18cdf56858 Mochitests for charset autodetection 2008-04-21 03:50:45 -07:00
smontagu@smontagu.org
9e9d40c772 Use universal charset detector with filters instead of the old language-specific parallel state machines. Bug 426271, r+sr=dbaron, b=vladimir 2008-04-08 23:36:22 -07:00
smontagu@smontagu.org
2d7e70b9f1 Remove unnecessary and obsolete code. Bug 424916, r+sr=dbaron, a=damons 2008-03-25 13:19:34 -07:00
benjamin@smedbergs.us
baab01ada6 Bug 376636 - Building with gcc 4.3 and -pendatic fails due to extra semicolons, patch by Art Haas <ahaas@airmail.net>, rs=me 2007-04-23 07:21:53 -07:00
hg@mozilla.com
05e5d33a57 Free the (distributed) Lizard! Automatic merge from CVS: Module mozilla: tag HG_REPO_INITIAL_IMPORT at 22 Mar 2007 10:30 PDT, 2007-03-22 10:30:00 -07:00
timeless%mozdev.org
29520925a4 Bug 106386 Correct misspellings in source code
also changing line to reference sequences since you don't count 'apple', you
count 'apples'.
r=glennrp@gmail.com, bclary@bclary.com rs=brendan
2006-11-01 23:04:06 +00:00
smontagu%smontagu.org
3e78aa2e92 Add a minimum data threshold to CharDistributionAnalysis::GetConfidence() to reduce false positives. Bug 306272, r=jshin, sr=rbs 2006-08-08 10:35:30 +00:00
timeless%mozdev.org
3a351ccd58 Bug 328009 UniversalChardetTest.cpp(65) : warning C4800: 'const PRBool' : forcing value to bool 'true' or 'false' (performance warning)
r=smontagu sr=bz
2006-03-21 06:35:45 +00:00
benjamin%smedbergs.us
078e45e203 Bug 78081 - don't export intermediate libraries - initial makefile changes to switch from $(DIST)/lib to in-tree library locations - blanket-r=mento 2006-02-14 18:13:49 +00:00
bsmedberg%covad.net
022df15b54 Bug 320553 - integrate universalchardet into libxul, r=bryner 2005-12-16 20:55:44 +00:00
jgmyers%speakeasy.net
a42bc6dc4f Bug 313211: improve universalchardet test harness r=somntagu 2005-11-02 19:43:32 +00:00
jgmyers%speakeasy.net
fa642069c6 Bug 177505: Misidentified iso-8859-1 text as GB18030 r=smontagu sr=roc 2005-11-02 19:41:13 +00:00
jgmyers%speakeasy.net
ca219654ec Bug 313199: separate out build of universalchardet xpcom wrapper r=benjamin moa=smontagu 2005-11-02 19:38:32 +00:00
bsmedberg%covad.net
42e30e3f4d Beginning to land the PACKAGING_20030906_BRANCH for bug 20640. Not part of the build, yet. 2005-11-02 16:57:30 +00:00
timeless%mozdev.org
7a5b49a926 Bug 86999 - Hebrew support for Universal (All) Autodetect
fixing debug output typo
patch by shoosh20012001@hotmail.com r=smontagu sr=roc a=asa
2005-11-02 16:57:29 +00:00
mozilla.mano%sent.com
25a6e66b15 Bug 304951 - error in chardet's Hebrew language model. patch from Shy Shalom <shoosh20012001@hotmail.com>, r=smontagu, sr=roc. 2005-11-02 16:57:28 +00:00
timeless%mozdev.org
f372746c7b Bug 86999 Hebrew support for Universal (All) Autodetect
patch by shoosh20012001@hotmail.com r=smontagu r=timeless sr=roc a=asa
HebrewProber written by shoosh20012001@hotmail.com
LangHebrewModel written by smontagu
some data compiled by xslf@xslf.com
2005-11-02 16:57:27 +00:00
jshin%mailaps.org
f7ee2f6f5d separate xpcom wrapper and charset detectors : patch by shanjian with some updates by me (r=smontagu, sr=alecf) 2005-11-02 16:57:26 +00:00
timeless%mozdev.org
37aaa287c3 Bug 228545 Bug "FilterWith..." in universal charset detector
patch by vladimir@lukianov.name r=smontagu sr=roc
2005-11-02 16:57:25 +00:00
timeless%mozdev.org
0828ff3e5a Bug 248307 lack of {} in mozilla\extensions\universalchardet\src\nsEUCJPProber.cpp:80
patch by vladimir@lukianov.name r=roc sr=roc
2005-11-02 16:57:24 +00:00
timeless%mozdev.org
504c031dee Bug 225994 DRefTool analysis for universalchardet files
r=smontagu sr=roc
2005-11-02 16:57:23 +00:00
timeless%mozdev.org
e08f95f026 Bug 228542 Divide by zero bug in Universal charset detecter
patch by vladimir@lukianov.name r=smontagu sr=roc
2005-11-02 16:57:22 +00:00
gerv%gerv.net
e7fa0aa43b Bug 236613: change to MPL/LGPL/GPL tri-license. 2005-11-02 16:57:21 +00:00
jgmyers%speakeasy.net
56ac4ee4f8 Accept halfwidth katakana in iso-2022-jp: bug 172393 r=ftang sr=roc 2005-11-02 16:57:19 +00:00
bzbarsky%mit.edu
6d83d3d0be Make frequence tables in universal charset decoder const to move them into the
readonly segment.  Bug 201361, patch by ataylor@its.to (Andrew Taylor),
r=smontagu, sr=alecf
2005-11-02 16:57:18 +00:00
shanjian%netscape.com
09397bd781 #92806 need to support GB18030 in universal detector
replace gb18030 state machine with the new one, which has been tested in
PSM detector.
r=ftang, sr=scc
2005-11-02 16:57:17 +00:00
shanjian%netscape.com
af6d628bba #111340 Autodetect All detect SJIS as Greek
r=yokoyama, sr=brendan
2005-11-02 16:57:16 +00:00
bryner%brianryner.com
16ccb6e2bf Fix build on gcc 3.4 by removing extra semicolons (bug 218551). r/sr=dbaron, a=brendan. 2005-11-02 16:57:15 +00:00
jgmyers%netscape.com
e583433df2 fix uninitialized member: bug 222182 r=sli0262 sr=blizzard 2005-11-02 16:57:14 +00:00
shanjian%netscape.com
3f7e5bdeed #115114 autodetect universal detects french as Central European (ISO-8859-2)
remove 2 hungarian probers.
r=ftang, sr=scc
2005-11-02 16:57:13 +00:00
jaggernaut%netscape.com
15cff7dc41 Bug 25886: Get rid of a bunch of unused NS_DEFINE_IID, NS_DEFINE_CID. r=caillon, sr=dbaron 2005-11-02 16:57:12 +00:00
shanjian%netscape.com
7c5f2f737a #171813 Universal auto detector doesn't work well on sohu news page
adjust the detecting parameter.
r=ftang, sr=jst
2005-11-02 16:57:11 +00:00
dbaron%dbaron.org
3b39e64909 Fix build warnings by using character literals. b=183354 r=smontagu sr=bzbarsky 2005-11-02 16:57:10 +00:00
yokoyama%netscape.com
34ecf37cc7 162894
Universal Auto Detector regression fix.
/r=yokoyama, shanjian, /sr=jst, /a=asa
2005-11-02 16:57:09 +00:00
cbiesinger%web.de
68332f717f 183354 patch by alexey@optus.net (Alexey Chernyak) r=ftang sr=blizzard Make Universal Charset Autodetector recognise UTF by BOM 2005-11-02 16:57:08 +00:00
dbaron%dbaron.org
89f7dc024a Bug 178643: Remove uses of NS_INIT_ISUPPORTS, since it is no longer needed. r=timeless sr=jag 2005-11-02 16:57:07 +00:00
shanjian%netscape.com
f062e7c7ba #162894 Extend universal detector's coverage to include iso-8859-1
Added latin1 prober.
r=smontagu, sr=jst, a=asa
2005-11-02 16:57:06 +00:00
dougt%netscape.com
2adb8cf305 166917. Clean up xpcom SDK includes. r=rpotts@netscape.com, sr=alecf@netscape.com, a=rjesup@wgate.com 2005-11-02 16:57:05 +00:00
shanjian%netscape.com
d76e079ac5 #92806 need to support GB18030 in universal detector
replace state machine for gb18030 with the new one, which has been tested
in PSM detector.
r=ftang, sr=scc,
2005-11-02 16:57:04 +00:00
shanjian%netscape.com
2c8314b3ba #122081 Universalchardet leaks memory
The fix for the nsCharSetProber class declaration, plus fix for
nsUniversalDetector methods, which may cause another leak when the object is
reused after Reset().

Patch proposed by Takayuki Tamura
r=shanjian, sr/a=brendan
2005-11-02 16:57:03 +00:00
shanjian%netscape.com
81c3bb18d5 #109913 universal charset detector improvement: use positive approach
r=yokoyama, sr=brendan
2005-11-02 16:57:02 +00:00
shanjian%netscape.com
ea0fa010bb #119377 NS crashes if use universal Charset detector on in frame
Added some range verification when calculating order.
r=nhotta, sr=brendan
2005-11-02 16:57:01 +00:00
alecf%netscape.com
2bbc37ef06 argh, also patch universal char det
similar to patch by bzbarsky@mit.edu that I had in my tree
fixes extensions bustage
2005-11-02 16:57:00 +00:00
shanjian%netscape.com
6f4a05ad03 #180988 ���������������������������������������� in text causes universal AD to display in Big5
count every char for total.
yokoyama: review+
bzbarsky: superreview+
2005-11-02 16:56:59 +00:00
timeless%mozdev.org
48d5de6d5d Bug 207057 Division by zero in UCDET.DLL
r=smontagu sr=rbs
2005-11-02 16:56:58 +00:00
shanjian%netscape.com
147b982cbe #100377 Auto-detect All detects yahoo-japan page as a wrong charset that save by Composer
r=yokoyama, sr=brendan
2005-11-02 16:56:57 +00:00
dbaron%fas.harvard.edu
18f228cbc6 Remove more no-longer-needed nsComponentManager::RegisterComponent calls. Remove remnants of NS_IMPL_IDS macro, associated with long-gone NS_DECLARE_ID. b=125782 r=dougt rs=jag 2005-11-02 16:56:56 +00:00
sfraser%netscape.com
1317dc443e Making the nsModuleComponentInfo data const; bug 74803. r=dp, sr=waterson 2005-11-02 16:56:55 +00:00
gerv%gerv.net
d340e8701c License changes, take 2. Bug 98089. mozilla/extensions/. 2005-11-02 16:56:54 +00:00
gerv%gerv.net
cc72cf6220 Oops. 2005-11-02 16:56:53 +00:00
scc%mozilla.org
cb42f8b12b bug #98089: ripped new license 2005-11-02 16:56:52 +00:00
shanjian%netscape.com
ced27ead66 #33337 Universal Charset Autodetection for mozilla tree
The code is almost the same as in commercial tree, 3rd party detector is removed, and name has been changed by roy.

r=shanjian/ftang  sr=blizzard
2005-11-02 16:56:51 +00:00
shanjian%netscape.com
2b63999c92 #92806 need to support GB18030 in universal detector
r= ftang sr=blizzard@mozilla.org

This fix also added some comment for:
#33337 Universal Charset Autodetection for mozilla tree
The code is almost the same as in commercial tree, 3rd party detector is removed, and name has been changed by roy.

r=shanjian/ftang  sr=blizzard
2005-11-02 16:56:50 +00:00
shanjian%netscape.com
43a6df9a3c Refer bugscape 7195, same change here. 2005-11-02 16:56:49 +00:00
shanjian%netscape.com
8d4c624d03 #90581 Cyrillic is not autodetected by nsIStringCharsetDetector
r= roy, sr=waterson
2005-11-02 16:56:48 +00:00
dbaron%fas.harvard.edu
4da68806ec Replace uses of deprecated NS_IMPL_ISUPPORTS and NS_IMPL_QUERYINTERFACE macros with NS_IMPL_{ISUPPORTS,QUERYINTERFACE}{1,0}. r=jag rs=brendan b=45797 2005-11-02 16:56:47 +00:00
shanjian%netscape.com
a766f9f238 #92807 universal detector does not work with mail/news
r=ftang@netscape.com	 sr=blizzard@mozilla.org

This fix also added some comment for:
#33337 Universal Charset Autodetection for mozilla tree
The code is almost the same as in commercial tree, 3rd party detector is removed, and name has been changed by roy.

r=shanjian/ftang  sr=blizzard
2005-11-02 16:56:46 +00:00
shanjian%netscape.com
3b2a85a6d4 #92339 Universal Auto Chardetector doesn't work.
(This is yokoyama@netscape.com 's bug. )
r= shanjian  sr=blizzard@mozilla.org

his fix also added some comment for:
#33337 Universal Charset Autodetection for mozilla tree
The code is almost the same as in commercial tree, 3rd party detector is removed, and name has been changed by roy.

r=shanjian/ftang  sr=blizzard
2005-11-02 16:56:45 +00:00
yokoyama%netscape.com
4322faf504 Bug : 33337 - Universal Charset Autodetection
Adding universal charset files to extension folder.
/r=shanjian, /r=ftang
2005-11-02 16:56:44 +00:00
mozilla.mano%sent.com
a72e1fcea5 Bug 304951 - error in chardet's Hebrew language model. patch from Shy Shalom <shoosh20012001@hotmail.com>, r=smontagu, sr=roc. 2005-08-26 19:09:41 +00:00
timeless%mozdev.org
82a99c962e Bug 86999 - Hebrew support for Universal (All) Autodetect
fixing debug output typo
patch by shoosh20012001@hotmail.com r=smontagu sr=roc a=asa
2005-08-09 19:43:46 +00:00
timeless%mozdev.org
7612af32a7 Bug 86999 Hebrew support for Universal (All) Autodetect
patch by shoosh20012001@hotmail.com r=smontagu r=timeless sr=roc a=asa
HebrewProber written by shoosh20012001@hotmail.com
LangHebrewModel written by smontagu
some data compiled by xslf@xslf.com
2005-08-09 04:50:07 +00:00
bsmedberg%covad.net
be30b2f533 Adding MOZILLA_INTERNAL_API to internal makefiles, bug 288626, rs=darin 2005-04-04 18:38:19 +00:00
darin%meer.net
189ba5e5c4 eliminating uses of deprecated nsComponentManager:: methods (bug 267040), r=bsmedberg 2004-11-01 18:50:36 +00:00
jshin%mailaps.org
1db31547d6 separate xpcom wrapper and charset detectors : patch by shanjian with some updates by me (r=smontagu, sr=alecf) 2004-08-03 01:50:37 +00:00
timeless%mozdev.org
6395759145 Bug 228542 Divide by zero bug in Universal charset detecter
patch by vladimir@lukianov.name r=smontagu sr=roc
2004-06-24 11:53:10 +00:00
timeless%mozdev.org
420a446203 Bug 228545 Bug "FilterWith..." in universal charset detector
patch by vladimir@lukianov.name r=smontagu sr=roc
2004-06-24 11:49:08 +00:00
timeless%mozdev.org
cd6e4fe1b9 Bug 248307 lack of {} in mozilla\extensions\universalchardet\src\nsEUCJPProber.cpp:80
patch by vladimir@lukianov.name r=roc sr=roc
2004-06-23 19:54:02 +00:00
timeless%mozdev.org
22ce9378ec Bug 225994 DRefTool analysis for universalchardet files
r=smontagu sr=roc
2004-04-25 11:08:44 +00:00
gerv%gerv.net
86486431ae Bug 236613: change to MPL/LGPL/GPL tri-license. 2004-04-18 22:14:17 +00:00
tor%cs.brown.edu
bbf2ef596f Bug 208314 - remove use of iostreams to allow building with compilers
that don't support classic C++ iostreams (ex: MSVC++ .net 2003).
r=timeless, sr=dbaron
2004-02-07 02:41:10 +00:00
bsmedberg%covad.net
d0f309943a Continuing to land the PACKAGING_20030906_BRANCH for bug 20640. Not part of the build, yet. 2004-01-07 13:37:00 +00:00
bsmedberg%covad.net
274ef7cd49 Beginning to land the PACKAGING_20030906_BRANCH for bug 20640. Not part of the build, yet. 2004-01-07 01:22:31 +00:00
jgmyers%speakeasy.net
ce5088ba8f Accept halfwidth katakana in iso-2022-jp: bug 172393 r=ftang sr=roc 2004-01-03 21:45:21 +00:00