<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-3521173402786945815</id><updated>2011-04-22T03:20:17.730+07:00</updated><category term='word alignment'/><category term='wiki'/><category term='part-of-speech'/><category term='orchid corpus'/><category term='word aligner'/><category term='GNU/Linux'/><category term='multilingual text'/><category term='howto'/><category term='tutorial'/><category term='Vee&apos;s blog'/><category term='Simplify'/><category term='map'/><category term='SVG'/><category term='XML'/><category term='format'/><category term='donation'/><category term='HFS+'/><category term='Ternary search tree'/><category term='corpus'/><category term='ATSUI'/><category term='www.vee-u.com'/><category term='Firefox'/><category term='dos2unix ubuntu debian'/><category term='Guide'/><category term='Ruby'/><category term='Mac OS X'/><category term='cakephp'/><category term='NetBSD'/><category term='thai'/><category term='Alignment'/><category term='Blog'/><category term='GIZA++'/><category term='encyclopedia'/><category term='problem'/><category term='patch'/><category term='manual'/><category term='Vee Satayamas'/><title type='text'>Vee - GNU/Linux</title><subtitle type='html'>This blog is "Vee - GNU/Linux" for feeding into LTN Planet since April 4th, 2007. Vee 's general blog moved to &lt;a href="http://blog.vee-u.com"&gt;blog.vee-u.com&lt;/a&gt;.</subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://vee-r.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://vee-r.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>veer</name><uri>http://www.blogger.com/profile/06771165466118347444</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>12</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-3521173402786945815.post-2523107381000947358</id><published>2007-07-20T02:41:00.000+07:00</published><updated>2007-07-28T04:08:49.789+07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='thai'/><category scheme='http://www.blogger.com/atom/ns#' term='ATSUI'/><category scheme='http://www.blogger.com/atom/ns#' term='patch'/><category scheme='http://www.blogger.com/atom/ns#' term='Firefox'/><title type='text'>รวมการเฉพาะกิจ เพื่อ patch Firefox</title><content type='html'>We are trying to make patch for Firefox to call native line breaking API on Mac OS X and Windows. If you are interested in this, please join &lt;a href="http://scratchpad.wikia.com/wiki/Firefox_Thai"&gt;http://scratchpad.wikia.com/wiki/Firefox_Thai&lt;/a&gt; (It is in Thai language).
&lt;br/&gt;&lt;br/&gt;
[Thai]
พอ post blog ได้ไม่นาน. ไม่นานจริงๆนะขนาดมา post blog นี้แล้วยัง download xcode ไม่เสร็จเลย. เก่ง.ws ติดต่อเข้ามาว่าอยากจะต่อ Firefox กับ ATSUI เหมือนกัน. &lt;a href="www.keng.ws"&gt;เก่ง.ws&lt;/a&gt; กับผมเลยร่วมกันสร้างหน้าวิกิขึ้นมา เพื่อใจได้ share ข้อมูลกัน รวมถึงคำถามคำตอบด้วย. เพื่อมีใครสนใจอีกจะได้เข้ามามีส่วนร่วมได้ง่ายๆ ด้วย. หน้านั้นอยู่ที่ &lt;a href="http://scratchpad.wikia.com/wiki/Firefox_Thai"&gt;http://scratchpad.wikia.com/wiki/Firefox_Thai&lt;/a&gt;.  นะครับ. ผมหวังว่าการ share น่าจะทำให้เราทำงานเสร็จได้โดยแต่ละคนไม่เหนื่อย และเหงา (มีปัญหาแล้วไม่รู้จะถามใคร ... แล้วก็ถามตัวเองว่ามานั่งทำอะไรอยู่คนเดียว.) จนเกินไป.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3521173402786945815-2523107381000947358?l=vee-r.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vee-r.blogspot.com/feeds/2523107381000947358/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3521173402786945815&amp;postID=2523107381000947358' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/2523107381000947358'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/2523107381000947358'/><link rel='alternate' type='text/html' href='http://vee-r.blogspot.com/2007/07/patch-firefox.html' title='รวมการเฉพาะกิจ เพื่อ patch Firefox'/><author><name>veer</name><uri>http://www.blogger.com/profile/06771165466118347444</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3521173402786945815.post-1108206310123028462</id><published>2007-07-19T22:46:00.000+07:00</published><updated>2007-07-19T22:59:14.091+07:00</updated><title type='text'>Firefox 3 + ATSUI</title><content type='html'>[Tinglish]
Since I have read Thep's blog, I have hope on Firefox's (accurately) Thai line breaking on Mac OS X. AFAIK from that blog, we can improve Thai line breaking for Firefox on Mac OS X by just connecting ATSUI to Firefox. Anyways, I think can't do this work by myself. Is there anyone working on this? If yes, I can join you. Even if you don't want me to join, please tell me then I can avoid doing this task. My first goal is building Firefox on Mac OS X. So now I'm reading this &lt;a href="http://developer.mozilla.org/en/docs/Mac_OS_X_Build_Prerequisites"&gt;http://developer.mozilla.org/en/docs/Mac_OS_X_Build_Prerequisites&lt;/a&gt;. I will upgrade my Xcode also. &lt;br/&gt;&lt;br/&gt;
[Thai]
อ่าน blog ของป๋าเทพแล้วก็มีความหวังกับการตัดคำภาษาไทยของ Firefox บน Mac OS X ขึ้นมา. เท่าที่อ่านดูต่อ ATSUI เข้ากับ Firefox ก็จะเป็นอันใช้ได้. แต่ผมคงไม่มีปัญญาทำเสร็จหรอก. อาจจะมีใครทำอยู่แล้วอยากให้ผมช่วยก็บอกกว่าหน่อยนะครับ. หรืออยากทำคนเดียวก็บอกได้อีกเหมือนกัน. ผมจะได้ไม่ต้องทำสบายไป. ตอนนี้สิ่งแรกที่ทำคือพยายามจะ build Firefox บน MAC OS X ให้ได้ก่อน. เลยต้องไปอ่าน &lt;a href="http://developer.mozilla.org/en/docs/Mac_OS_X_Build_Prerequisites"&gt;http://developer.mozilla.org/en/docs/Mac_OS_X_Build_Prerequisites&lt;/a&gt;. กะว่าจะลง Xcode ใหม่ด้วย.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3521173402786945815-1108206310123028462?l=vee-r.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vee-r.blogspot.com/feeds/1108206310123028462/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3521173402786945815&amp;postID=1108206310123028462' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/1108206310123028462'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/1108206310123028462'/><link rel='alternate' type='text/html' href='http://vee-r.blogspot.com/2007/07/firefox-3-atsui.html' title='Firefox 3 + ATSUI'/><author><name>veer</name><uri>http://www.blogger.com/profile/06771165466118347444</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3521173402786945815.post-6896677655253554664</id><published>2007-05-24T17:12:00.000+07:00</published><updated>2007-05-24T17:16:59.243+07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='cakephp'/><category scheme='http://www.blogger.com/atom/ns#' term='donation'/><title type='text'>CakePHP: Donation</title><content type='html'>Yesterday, I donate 5 USD to Cake Software Foundation. 

5 USD = 5 meals (for me, in Thailand). 

So it is much money :-P&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3521173402786945815-6896677655253554664?l=vee-r.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vee-r.blogspot.com/feeds/6896677655253554664/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3521173402786945815&amp;postID=6896677655253554664' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/6896677655253554664'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/6896677655253554664'/><link rel='alternate' type='text/html' href='http://vee-r.blogspot.com/2007/05/cakephp-donation.html' title='CakePHP: Donation'/><author><name>veer</name><uri>http://www.blogger.com/profile/06771165466118347444</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3521173402786945815.post-3690511256570744012</id><published>2007-04-17T18:04:00.000+07:00</published><updated>2007-04-17T18:08:47.648+07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='www.vee-u.com'/><category scheme='http://www.blogger.com/atom/ns#' term='Vee Satayamas'/><category scheme='http://www.blogger.com/atom/ns#' term='Vee&apos;s blog'/><category scheme='http://www.blogger.com/atom/ns#' term='GNU/Linux'/><category scheme='http://www.blogger.com/atom/ns#' term='Blog'/><title type='text'>For GNU/Linux only</title><content type='html'>I found that I posted a lot articles about human language technology and etc. here. Thus, I create new blog (and homepage) at &lt;a href="http://www.vee-u.com"&gt;www.vee-u.com&lt;/a&gt;. And I try to post mostly GNU/Linux and free software related stuff here.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3521173402786945815-3690511256570744012?l=vee-r.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vee-r.blogspot.com/feeds/3690511256570744012/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3521173402786945815&amp;postID=3690511256570744012' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/3690511256570744012'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/3690511256570744012'/><link rel='alternate' type='text/html' href='http://vee-r.blogspot.com/2007/04/for-gnulinux-only.html' title='For GNU/Linux only'/><author><name>veer</name><uri>http://www.blogger.com/profile/06771165466118347444</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3521173402786945815.post-5692657847814150906</id><published>2007-03-31T16:40:00.000+07:00</published><updated>2007-04-11T17:32:53.232+07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='orchid corpus'/><category scheme='http://www.blogger.com/atom/ns#' term='corpus'/><category scheme='http://www.blogger.com/atom/ns#' term='thai'/><category scheme='http://www.blogger.com/atom/ns#' term='format'/><category scheme='http://www.blogger.com/atom/ns#' term='part-of-speech'/><category scheme='http://www.blogger.com/atom/ns#' term='XML'/><title type='text'>Converting Orchid corpus to XML</title><content type='html'>Orchid corpus is a Thai part-of-speech annotated corpus, which is used to be freely  available on Nectec's website. (I wish it will become available again.) Since, it has quite unique format so it is quite inconvenient to handle. Therefore I just wrote a &lt;a href="http://www.vee-u.com/src/orchid_to_xml.rb"&gt;script&lt;/a&gt; to convert it to XML. Then I can just use a XML parser like pulldom to handle it by using a familiar API e.g. (pull)DOM etc.

The example for Orchid corpus format.
%metadata
%metadata
#P1
#1
blaa blaa blaa//
blaa/NNNN
blaa/NNNN
blaa/NNNN
//

The example XML for Orchid corpus format.
&amp;lt;corpus&amp;gt;
 &amp;lt;document author="abcd" ...&amp;gt;
     &amp;lt;paragraph&amp;gt;
         &amp;lt;sentence raw_txt="blaa blaa blaa"&amp;gt;
             &amp;lt;word surface="blaa" pos="NNNN"/&amp;gt;
             &amp;lt;word surface="blaa" pos="NNNN"/&amp;gt;
             &amp;lt;word surface="blaa" pos="NNNN"/&amp;gt;
             &amp;lt;word surface="blaa" pos="NNNN"/&amp;gt;
         &amp;lt;/sentence&amp;gt;
     &amp;lt;/paragraph&amp;gt;
 &amp;lt;/document&amp;gt;
...
&amp;lt;/corpus&amp;gt;

TEI format is probably suit for this job but I am just to lazy to read the specification.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3521173402786945815-5692657847814150906?l=vee-r.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vee-r.blogspot.com/feeds/5692657847814150906/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3521173402786945815&amp;postID=5692657847814150906' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/5692657847814150906'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/5692657847814150906'/><link rel='alternate' type='text/html' href='http://vee-r.blogspot.com/2007/03/converting-orchid-corpus-to-xml.html' title='Converting Orchid corpus to XML'/><author><name>veer</name><uri>http://www.blogger.com/profile/06771165466118347444</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3521173402786945815.post-2001170246382740414</id><published>2007-03-28T02:17:00.000+07:00</published><updated>2007-03-28T02:24:45.684+07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='multilingual text'/><category scheme='http://www.blogger.com/atom/ns#' term='SVG'/><category scheme='http://www.blogger.com/atom/ns#' term='problem'/><category scheme='http://www.blogger.com/atom/ns#' term='Mac OS X'/><category scheme='http://www.blogger.com/atom/ns#' term='Firefox'/><title type='text'>Displaying multilingual text in SVG using Firefox</title><content type='html'>In Khem's tree editor, SVG is used for displaying tree in Firefox. Firefox 2.x on Windows XP can display English text and Thai text in SVG correctly. But when I try to use Firefox 2.x on Mac OS X, Thai, Bengari and Chinese text became a box as shown below.

&lt;img src="http://farm1.static.flickr.com/187/436649841_ee4af7beb7_o.png" alt="firefox screenshot"/&gt;
&lt;div style="margin-left: 40px;"&gt;
 (using this following code)
 &amp;lt;svg xmlns="http://www.w3.org/2000/svg"
      xmlns:xlink="http://www.w3.org/1999/xlink"
      version="1.1"
      baseProfile="full"&amp;gt;
  &amp;lt;text x="50" y="50"
         font-size="16" fill="blue" &amp;gt;
     Wikipedia 維基百科 วิกิพีเดีย উইকিপিডিয়া
   &amp;lt;/text&amp;gt;  
 &amp;lt;/svg&amp;gt;

&lt;/div&gt;Thus, I try to assign a font family to the text as the following code:

&lt;div style="margin-left: 40px;"&gt;  &amp;lt;svg xmlns="http://www.w3.org/2000/svg"
      xmlns:xlink="http://www.w3.org/1999/xlink"
      version="1.1"
      baseProfile="full"&amp;gt;
  &amp;lt;text x="50" y="50"      
 &lt;span style="font-weight: bold;"&gt;font-family="Garuda"&lt;/span&gt; font-size="16"
 fill="blue" &amp;gt;
     Wikipedia 維基百科 วิกิพีเดีย উইকিপিডিয়া
   &amp;lt;/text&amp;gt;  
 &amp;lt;/svg&amp;gt;

&lt;/div&gt;It works. Firefox can display Thai text correctly. However, Firefox still cannot display Bangari text and Chinese text. As shown below.

&lt;img src="http://farm1.static.flickr.com/178/436649855_6ea125722b_o.png" alt="firefox screenshot"/&gt;

I try to use other font families, i.e. Times, Sans and Helvetica but only English text can be displayed.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3521173402786945815-2001170246382740414?l=vee-r.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vee-r.blogspot.com/feeds/2001170246382740414/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3521173402786945815&amp;postID=2001170246382740414' title='5 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/2001170246382740414'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/2001170246382740414'/><link rel='alternate' type='text/html' href='http://vee-r.blogspot.com/2007/03/displaying-multilingual-text-in-svg.html' title='Displaying multilingual text in SVG using Firefox'/><author><name>veer</name><uri>http://www.blogger.com/profile/06771165466118347444</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>5</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3521173402786945815.post-8437124135502435441</id><published>2007-02-25T23:04:00.000+07:00</published><updated>2007-07-28T04:08:10.898+07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Ternary search tree'/><category scheme='http://www.blogger.com/atom/ns#' term='Ruby'/><title type='text'>A pure ruby ternary search tree implementation</title><content type='html'>&lt;a href="http://www.geocities.com/veetai/tst_rb.txt"&gt;source code&lt;/a&gt;. It takes 10 minutes to load the Yaitron dictionary. Thus, I try &lt;a href="http://kolchak.sdf-eu.org/res/ctst-README.html"&gt;ctst&lt;/a&gt; :-P. Thank lindever for introducing me TST :-)&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3521173402786945815-8437124135502435441?l=vee-r.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vee-r.blogspot.com/feeds/8437124135502435441/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3521173402786945815&amp;postID=8437124135502435441' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/8437124135502435441'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/8437124135502435441'/><link rel='alternate' type='text/html' href='http://vee-r.blogspot.com/2007/02/pure-ruby-ternary-search-tree.html' title='A pure ruby ternary search tree implementation'/><author><name>veer</name><uri>http://www.blogger.com/profile/06771165466118347444</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3521173402786945815.post-7639347650403341113</id><published>2007-01-05T16:48:00.000+07:00</published><updated>2007-01-05T16:52:35.665+07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='map'/><category scheme='http://www.blogger.com/atom/ns#' term='wiki'/><category scheme='http://www.blogger.com/atom/ns#' term='encyclopedia'/><title type='text'>wiki + encyclopedia + map</title><content type='html'>&lt;a href="http://wikimapia.org/#y=13848455&amp;x=100575371&amp;z=15&amp;l=0&amp;m=a&amp;v=2"&gt;http://wikimapia.org/#y=13848455&amp;x=100575371&amp;z=15&amp;l=0&amp;m=a&amp;v=2&lt;/a&gt;
&lt;br/&gt;
Via: ans&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3521173402786945815-7639347650403341113?l=vee-r.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vee-r.blogspot.com/feeds/7639347650403341113/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3521173402786945815&amp;postID=7639347650403341113' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/7639347650403341113'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/7639347650403341113'/><link rel='alternate' type='text/html' href='http://vee-r.blogspot.com/2007/01/wiki-encyclopedia-map.html' title='wiki + encyclopedia + map'/><author><name>veer</name><uri>http://www.blogger.com/profile/06771165466118347444</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3521173402786945815.post-2955236792716948349</id><published>2007-01-05T14:15:00.000+07:00</published><updated>2007-01-05T14:17:58.163+07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='dos2unix ubuntu debian'/><title type='text'>Where is dos2unix (on Ubuntu)?</title><content type='html'>&lt;a href="http://packages.debian.org/unstable/utils/tofrodos"&gt;http://packages.debian.org/unstable/utils/tofrodos&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3521173402786945815-2955236792716948349?l=vee-r.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vee-r.blogspot.com/feeds/2955236792716948349/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3521173402786945815&amp;postID=2955236792716948349' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/2955236792716948349'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/2955236792716948349'/><link rel='alternate' type='text/html' href='http://vee-r.blogspot.com/2007/01/where-is-dos2unix-on-ubuntu.html' title='Where is dos2unix (on Ubuntu)?'/><author><name>veer</name><uri>http://www.blogger.com/profile/06771165466118347444</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3521173402786945815.post-528373365884141730</id><published>2006-12-25T18:11:00.004+07:00</published><updated>2008-03-04T12:01:03.228+07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='word alignment'/><category scheme='http://www.blogger.com/atom/ns#' term='word aligner'/><category scheme='http://www.blogger.com/atom/ns#' term='howto'/><category scheme='http://www.blogger.com/atom/ns#' term='Alignment'/><category scheme='http://www.blogger.com/atom/ns#' term='GIZA++'/><category scheme='http://www.blogger.com/atom/ns#' term='tutorial'/><category scheme='http://www.blogger.com/atom/ns#' term='manual'/><category scheme='http://www.blogger.com/atom/ns#' term='Guide'/><category scheme='http://www.blogger.com/atom/ns#' term='Simplify'/><title type='text'>GIZA++ Guide</title><content type='html'>A newer and easier guide for Ubuntu/Debian users is available at &lt;a href="http://blog.vee-u.com/2008/03/02/giza_pp/"&gt;http://blog.vee-u.com/2008/03/02/giza_pp/&lt;/a&gt; 

&lt;ul&gt;
&lt;li&gt;Firstly, we have to prepare 2 text files, which each lines are identical. For example,&lt;br/&gt;
&lt;br/&gt;
tha.txt:&lt;br/&gt;
ฉัน กิน ข้าว&lt;br/&gt;
ฉัน ไป โรงเรียน&lt;br/&gt;
&lt;br/&gt;
eng.txt:&lt;br/&gt;
I eat rice.&lt;br/&gt;
I go to school.&lt;br/&gt;
&lt;/li&gt;

&lt;li&gt;Secondly, generating vocabulary files and correspondences file, using plain2snt.out. For example plain2snt eng.txt tha.txt. It must generate eng_tha.snt, eng.vcb and tha.vcb.
&lt;/li&gt;

&lt;li&gt;Writing configuration file. For example,&lt;br/&gt;
&lt;br/&gt;
config:&lt;br/&gt;
outputfileprefix play_giza&lt;br/&gt;
sourcevocabularyfile eng.vcb&lt;br/&gt;
targetvocabularyfile tha.vcb&lt;br/&gt;
c eng_tha.snt&lt;br/&gt;

&lt;/li&gt;
&lt;li&gt;Finally, running GIZA++ using this command. "GIZA++ config". Then the final result must be in the file play_giza.A3.final. (be careful if you use Mac OS X)
&lt;/li&gt;
&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3521173402786945815-528373365884141730?l=vee-r.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vee-r.blogspot.com/feeds/528373365884141730/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3521173402786945815&amp;postID=528373365884141730' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/528373365884141730'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/528373365884141730'/><link rel='alternate' type='text/html' href='http://vee-r.blogspot.com/2006/12/giza-guide.html' title='GIZA++ Guide'/><author><name>veer</name><uri>http://www.blogger.com/profile/06771165466118347444</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3521173402786945815.post-8452978715549722711</id><published>2006-12-25T02:13:00.000+07:00</published><updated>2006-12-25T02:27:46.298+07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='Alignment'/><category scheme='http://www.blogger.com/atom/ns#' term='GIZA++'/><category scheme='http://www.blogger.com/atom/ns#' term='XML'/><title type='text'>GIZA++: XML output</title><content type='html'>An alignment output from GIZA++ is in special format. It looks nice and readable but I just don't want to write a parser. Hence I modified GIZA++ to output XML instead. &lt;a href="http://www.geocities.com/veetai/giza_report_xml.patch.gz"&gt;[Download the patch]&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3521173402786945815-8452978715549722711?l=vee-r.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vee-r.blogspot.com/feeds/8452978715549722711/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3521173402786945815&amp;postID=8452978715549722711' title='3 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/8452978715549722711'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/8452978715549722711'/><link rel='alternate' type='text/html' href='http://vee-r.blogspot.com/2006/12/giza-xml-output.html' title='GIZA++: XML output'/><author><name>veer</name><uri>http://www.blogger.com/profile/06771165466118347444</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>3</thr:total></entry><entry><id>tag:blogger.com,1999:blog-3521173402786945815.post-5136841942519626053</id><published>2006-12-24T01:26:00.001+07:00</published><updated>2008-06-16T22:24:36.086+07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='NetBSD'/><category scheme='http://www.blogger.com/atom/ns#' term='HFS+'/><category scheme='http://www.blogger.com/atom/ns#' term='GIZA++'/><category scheme='http://www.blogger.com/atom/ns#' term='Mac OS X'/><title type='text'>GIZA++ on Mac OS X (HFS+)</title><content type='html'>Today I find that foobar.a3.final and foobar.A3.final are the same file on HFS+ (the file system are used in my iBook).  Now I know why foobar.A3.final in my working directory is not the same as what mentioned in &lt;a href="http://www.fjoch.com/GIZA++.html"&gt;GIZA++&lt;/a&gt;'s README. A workaround is as follow:
&lt;pre&gt;
diff -Nuar GIZA++-v2/model3.cc GIZA++-v2-osx/model3.cc
--- GIZA++-v2/model3.cc Tue Sep 30 21:24:18 2003
+++ GIZA++-v2-osx/model3.cc     Sat Dec 23 18:16:08 2006
@@ -318,8 +318,8 @@
     d4file = Prefix + ".d4." + number ;
     d4file2 = Prefix + ".D4." + number ;
     d5file = Prefix + ".d5." + number ;
-      alignfile = Prefix + ".A3." + number ;
-      test_alignfile = Prefix + ".tst.A3." + number ;
+      alignfile = Prefix + ".uA3." + number ;
+      test_alignfile = Prefix + ".tst.uA3." + number ;
     p0file = Prefix + ".p0_3." + number ;
   }
   // clear count tables
&lt;/pre&gt;I noticed this after running GIZA++ on &lt;a href="http://www.netbsd.org/"&gt;NetBSD&lt;/a&gt; and the result was just like in README.

Update: Now I switched from Mac OS X to Ubuntu &lt;a href="http://blog.vee-u.com/2008/03/02/giza_pp/"&gt;http://blog.vee-u.com/2008/03/02/giza_pp/&lt;/a&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/3521173402786945815-5136841942519626053?l=vee-r.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://vee-r.blogspot.com/feeds/5136841942519626053/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=3521173402786945815&amp;postID=5136841942519626053' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/5136841942519626053'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/3521173402786945815/posts/default/5136841942519626053'/><link rel='alternate' type='text/html' href='http://vee-r.blogspot.com/2006/12/giza-on-mac-os-x-hfs.html' title='GIZA++ on Mac OS X (HFS+)'/><author><name>veer</name><uri>http://www.blogger.com/profile/06771165466118347444</uri><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:total>2</thr:total></entry></feed>
