39 Commits

Author SHA1 Message Date
Theppitak Karoonboonyanan 6702fb2b9f Thai: Add hyphenation exceptions.
* thai-exc.pat:
  + Add forcing exceptions for:
    - ปโต~เล-มี
    - พรม-คด
    - อี-สป
    - เก~ร-ละ
    - เทส-ลา
    - เอส~เพรส-โซ
    - โซ-ฟี
  + Add blocking exceptions for:
    - ปฏิ~ป-ทา
2018-08-03 15:02:19 +07:00
Theppitak Karoonboonyanan 0143d9abea Thai: Update dict from libthai 0.1.28
* tdict-city.txt:
* tdict-common.txt:
* tdict-country.txt:
* tdict-district.txt:
* tdict-geo.txt:
* tdict-history.txt:
* tdict-ict.txt:
* tdict-lang-ethnic.txt:
* tdict-proper.txt:
* tdict-science.txt:
* tdict-slang.txt:
* tdict-spell.txt:
* tdict-std.txt:
  - Update dict from libthai 0.1.28.

* thai-exc.pat:
  - Drop 'า7มี' exception, duplicating generated 'า3มี'.
  - Drop 'โค7ลี' exception, duplicating 'โค5ลี'.
  - Drop '<SARA-II>7รี' exception, duplicating '<SARA-II>3รี'.
2018-08-03 15:01:45 +07:00
Theppitak Karoonboonyanan 626ac79748 Thai: Move slang dict entries to a new file.
* Makefile, +tdict-slang.txt, tdict-common.txt:
  - Manually move hyphenated slang entries from tdict-common.txt
    to tdict-slang.txt, according to change in libthai dict
    categories.
2018-08-03 15:00:20 +07:00
Theppitak Karoonboonyanan ffdf2d5549 Reformat header as YAML.
* hyph-th.tex.in:
  - Reformat the template file header as YAML.
2017-11-02 11:08:14 +07:00
Theppitak Karoonboonyanan 4c79390847 Add hyphenation exceptions.
* thai-exc.pat:
  + Add forcing exceptions for:
    - ทัก~ษ-อร
    - เด-ชะ
    - โก~โบ-ริ
  + Add blocking exceptions for:
    - นิร~ช-รา, เน~รัญ~ช-รา
2017-10-30 15:36:17 +07:00
Theppitak Karoonboonyanan 527d283dea Update dict from libthai 0.1.27
* tdict-city.txt:
* tdict-common.txt:
* tdict-district.txt:
* tdict-geo.txt:
* tdict-history.txt:
* tdict-ict.txt:
* tdict-lang-ethnic.txt:
* tdict-proper.txt:
* tdict-science.txt:
* tdict-spell.txt:
  - Update dict from libthai 0.1.27.

* thai-exc.pat:
  - Drop 'ฮ7โ' exception, which duplicated the generated 'ฮ1โ'.
2017-10-30 15:34:49 +07:00
Theppitak Karoonboonyanan 3234c7bf68 Update URLs in files
* README, hyph-th.tex.in:
  - Use HTTPS for LTN project pages

* hyph-th.tex.in:
  - Replace obsolete LTN SVN URL with that of GitHub
2017-10-30 15:20:06 +07:00
Theppitak Karoonboonyanan c7aa4cf03a Update dict from libthai 0.1.26
* tdict-city.txt:
* tdict-common.txt:
* tdict-geo.txt:
* tdict-ict.txt:
* tdict-proper.txt:
* tdict-science.txt:
* tdict-spell.txt:
* tdict-std-compound.txt:
* tdict-std.txt:
  - Update dict from libthai 0.1.26.

Make hyphenation of "*สเตอร์*" portions consistent.

* tdict-city.txt:
  - Correct hyphenation for 'อัมสเตอร์ดัม'.
* tdict-common.txt:
  - Correct hyphenation for 'ฮิปสเตอร์'.
2017-10-30 15:15:04 +07:00
Theppitak Karoonboonyanan ed0ae3db25 Update dict from libthai 0.1.25.
* tdict-city.txt:
* tdict-common.txt:
* tdict-country.txt:
* tdict-district.txt:
* tdict-ict.txt:
* tdict-lang-ethnic.txt:
* tdict-proper.txt:
* tdict-science.txt:
* tdict-spell.txt:
  - Update dict from libthai 0.1.25.

* tdict-std.txt:
  - Correct hyphenation for 'สาธารณสมบัติ'.
2017-10-30 15:11:47 +07:00
Theppitak Karoonboonyanan eeb973c441 Fix hyphenation dict tools.
* diff-dicts.sh, test-hyphen.sh:
  - Convert pdflatex output to UTF-8 before applying grep,
    as newer grep seems to treat TIS-620 string as binary
    and output error message instead.
2017-10-30 15:05:29 +07:00
Mojca Miklavec 16beb96a20 update Indic patterns from github, licence changed to MIT 2016-01-23 08:58:11 +01:00
Theppitak Karoonboonyanan af04e31920 Update Thai hyphenation dictionary. 2015-05-07 19:48:29 +07:00
Theppitak Karoonboonyanan 1848cdad47 Update Thai hyphenation dictionary. 2014-08-22 12:51:25 +07:00
Mojca Miklavec 2cb4407004 git repository for indic patterns 2014-06-04 15:57:07 +02:00
Theppitak Karoonboonyanan 4f926eace3 Add source for Thai hyphenation dictionary. 2013-10-29 10:29:48 +07:00
Mojca Miklavec 8a8c3c0b8f reintroduce Georgian patterns
- add LPPL licence
- a new version 0.3 with lefthyphenmin=1, righthyphenmin=2
- propdel svn:binary from hyph-ka.chr.tx
2013-06-04 18:03:52 +02:00
Mojca Miklavec d20552eba0 remove Georgian patterns (waiting for licence) 2013-05-26 21:28:06 +02:00
Mojca Miklavec b60bbb47b7 remove '11' from Georgian patterns 2013-03-01 18:00:04 +01:00
Mojca Miklavec a4e52da076 update Georgian wordlist and patterns 2013-03-01 17:31:38 +01:00
Mojca Miklavec fff2b2f7ad Add the wordlist used for generating Georgian hyphenation patterns plus translation table UTF-8 -> T8M 2013-02-25 06:14:20 +01:00
Mojca Miklavec 6335d49c53 location of indic patterns has changed 2011-06-07 11:51:44 +02:00
Mojca Miklavec 62d70ff4b9 URL for indic patterns has changed 2010-09-20 18:14:18 +02:00
Mojca Miklavec cba917c5f8 add Afrikaans patterns 2010-09-18 02:35:37 +02:00
Mojca Miklavec 7b2646cf71 a preliminary script for lao (might never be used at all) 2010-05-31 20:28:45 +02:00
Mojca Miklavec 04328dd200 a naive attempt to merge changes from branch/luatex into trunk (svn merge -r 341:415 svn://mojca@tug.org/texhyphen/branches/luatex) 2010-05-31 18:41:04 +02:00
Mojca Miklavec 6f9e62d32a add mongolian testsuite (compares different pattern versions) 2010-03-19 21:01:27 +01:00
Mojca Miklavec 0f15a78c53 fix tiny problems with indic patterns that made patterns fail to load in TeX (that should better be fixed upstream) 2010-03-19 10:40:26 +01:00
Mojca Miklavec f8a0485746 add pattern loaders for indic script (check for hyphenmin) 2010-03-17 18:38:41 +01:00
Mojca Miklavec 708f0d9538 add patterns by Santhosh Thottingal (+ script to fetch and convert them) 2010-03-17 18:23:37 +01:00
Mojca Miklavec c33cbb8b98 Add a list of hyphenated words to repository; there are zero-width-spaces present that need to be removed. 2010-03-17 09:26:20 +01:00
Mojca Miklavec 58dacb6b70 removed sources for basque from here (they are under sources) 2008-06-25 17:18:02 +02:00
Mojca Miklavec 3d738791d6 removed sources for galician from here 2008-06-25 17:17:24 +02:00
Mojca Miklavec 7851e1367d pattern file renamed 2008-06-23 15:57:20 +02:00
Mojca Miklavec 6ff837ca8c add sources for galician hyphenation patterns 2008-06-23 15:53:59 +02:00
Mojca Miklavec ee2d1918f1 source for generating galician added 2008-06-19 14:10:55 +02:00
Mojca Miklavec fd01468693 some executables removed 2008-06-10 03:01:04 +02:00
Arthur Reutenauer f2ae912857 * Modified pattern loading mechanism.
* Added patterns and pattern loaderns for Greeks
* Moved source/tk to source/tr (tk is Turkmen!)
2008-06-09 00:07:29 +02:00
Mojca Miklavec cbaa69172b auto-generator for basque 2008-06-08 16:17:41 +02:00
Mojca Miklavec 0d74ce3634 paper about turkish hyphenations 2008-06-03 12:56:56 +02:00