Metadata-Version: 2.0
Name: igo-python
Version: 0.9.9
Summary: Python port of Igo Japanese morphological analyzer
Home-page: https://github.com/hideaki-t/igo-python/
Author: Hideaki Takahashi
Author-email: mymelo@gmail.com
License: MIT
Keywords: japanese,morphological analyzer
Platform: UNKNOWN
Classifier: Development Status :: 4 - Beta
Classifier: Intended Audience :: Developers
Classifier: License :: OSI Approved :: MIT License
Classifier: Natural Language :: Japanese
Classifier: Operating System :: OS Independent
Classifier: Operating System :: Microsoft :: Windows
Classifier: Operating System :: POSIX :: Linux
Classifier: Programming Language :: Python :: 2.7
Classifier: Programming Language :: Python :: 3.3
Classifier: Programming Language :: Python :: 3.4
Classifier: Programming Language :: Python :: 3.5
Classifier: Programming Language :: Python :: 3.6
Classifier: Programming Language :: Python :: Implementation :: CPython
Classifier: Programming Language :: Python :: Implementation :: IronPython
Classifier: Programming Language :: Python :: Implementation :: PyPy
Classifier: Topic :: Scientific/Engineering :: Information Analysis
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Topic :: Text Processing :: Linguistic

================
 Igo for Python
================

About
=====

Igo_ is a Japanese morphological analyzer written in Java and Common Lisp.
This software is Python port of Igo(Java version).

.. _Igo: http://igo.osdn.jp/

Notice
======

Dictionary builder is not provided. You need to use Igo Java version to build the dictionary for Igo.
>From igo-python 0.9.7, pre-built `IPA dictionary (2.7.0-20070801)`__ is included for ease of use.

__ https://drive.google.com/uc?export=download&id=0B4y35FiV1wh7MWVlSDBCSXZMTXM

How To Use
==========

You can use Igo Python easily::

 >>> from igo.Tagger import Tagger
 >>> t = Tagger() # use bundled dictionary
 >>> for m in t.parse(u'すもももももももものうち'):
 ...     print m.surface, m.feature
 ...
 すもも 名詞,一般,*,*,*,*,すもも,スモモ,スモモ
 も 助詞,係助詞,*,*,*,*,も,モ,モ
 もも 名詞,一般,*,*,*,*,もも,モモ,モモ
 も 助詞,係助詞,*,*,*,*,も,モ,モ
 もも 名詞,一般,*,*,*,*,もも,モモ,モモ
 の 助詞,連体化,*,*,*,*,の,ノ,ノ
 うち 名詞,非自立,副詞可能,*,*,*,うち,ウチ,ウチ
 >>>


Using command line parser::

  $ igo # or python -m igo.parse
  私の名前は中野です
  私      名詞,代名詞,一般,*,*,*,私,ワタシ,ワタシ
  の      助詞,連体化,*,*,*,*,の,ノ,ノ
  名前    名詞,一般,*,*,*,*,名前,ナマエ,ナマエ
  は      助詞,係助詞,*,*,*,*,は,ハ,ワ
  中野    名詞,固有名詞,地域,一般,*,*,中野,ナカノ,ナカノ
  です    助動詞,*,*,*,特殊・デス,基本形,です,デス,デス
  EOS


Changelog for Igo-Python
========================
2017-02-09 -- 0.9.9
    * fixed few small issues and add unit test.

2016-07-08 -- 0.9.8
    * fixed a surrogate pair character handling issue

2015-12-16 -- 0.9.7.1
    * Tagger can be pickled on Python2 env

2015-12-14 -- 0.9.7
    * include pre-built IPA dictionary

2015-07-03 -- 0.9.6
    * fixed an issue of dictionary size in mmap mode

2015-04-14 -- 0.9.5
    * use mmap to load files of a dictionary if it is possible

2013-07-05 -- 0.9.3
    * support characters in Supplementary Planes

2012-02-27 -- 0.9.2
    * apply changes in Igo(java) 0.4.4

2012-02-27 -- 0.9.1
    * synchronize with Igo(java) 0.4.5

2011-09-01 -- 0.9
    * compatibility with python3.2
    * update the code base to igo-0.4.3
    * add some enhances and clean up.

2011-08-29 -- 0.4
    * add some enhances and clean up.

2010-11-28 -- 0.3a
    * fix bugs(if a phrase ends with 'た' will causes error)

2010-11-27 -- 0.3
    * drop mmap related code
    * reduce memory footprint

2010-11-27 -- 0.2
    * support Google App Engine(maybe)

2010-11-27 -- 0.1a
    * update package info only.

2010-11-25 -- 0.1
    * first release.


