kanjistat 0.13.1 (2024-05-17)
Bug fixes
-
compare_neighborhoodsgave obscure errors when stroke edit distances involved kanji with index > 2133. Fixed by returning an explicit error if the key kanji has such an index and setting the corresponding return value to NA if any of the closest kanji in the kanji distance has such an index.
kanjistat 0.13.0 (2024-05-10)
Breaking changes
- Function
kanjidistwithapprox = "pc"orapprox = "pcweighted"now runs only forkanjivecobjects generated with kanjistat 0.13.0 or newer.
New features
The structure of
kanjivecobjects has been extended. Each strokes in thestroketreecomponent now has an additional attribute"beziermat"which describes the Bézier curves of the stroke in a standardized 2 x (1+3n) matrix format (n = number of curves). The new structure is fully backward compatible. Whether a given kanjivec objectkanfollows the new structure can be tested byattr(kan, "kanjistat_version") >= 0.13.0. Thekvecjoyodataset on https://github.com/dschuhmacher/kanjistat.data has been updated accordingly.New function
compare_neighborhoods, which currently compares stroke edit distances and kanji distances in a dstrokedit neighborhood of a given kanji and optionally extends the comparison to nearest neighbors in the kanji distance. This function is still somewhat experimental.kanjidistandkanjidistmathave a new parameterminor_warningswhich toggles any warnings that can be ignored by most users. These warnings usually point to issues in the underlyingkanjivecdata or thekanjidistcomputation that are currently addressed by workarounds.
Enhancements
- kanjidist with
approx = "pc"orapprox = "pcweighted"runs considerably faster with the newkanjivecobjects, because the inefficient (multiple) parsing ofdattributes from previous versions is now avoided.
Bug fixes
- Producing the point cloud representations produced an error for some individual
kanjivecobjects. Fixed in the internal functions. Bothkanjivecwith non-default parameterbezier_discrandkanjidistwithapprox = "pc"orapprox = "pcweighted"should run now in all cases without problems (tested for Jouyou kanji).
kanjistat 0.12.0 (2024-05-02)
- kanjistat depends on R (>= 4.1) and transport (>= 0.15) now.
New features
Function
kanjidisthas a new argumentapprox, which specifies how the strokes are to be approximated for computing component distances. The three options “grid”, “pc” or “pcweighted” work in any combination with the three options for thetypeargument (which now strictly specifies the type of distance used for the components).Function
kanjivechas a new argumentbezier_discr, which may be any of “svgparser”, “eqtimed” and “eqspaced”, specifing, for the discretization of the strokes in thestroketreecomponent, which code is used and according to which strategy the points are placed.Data set
pooled_similaritycontains the human similarity judgements of kanji from Yencken and Baldwin (2008).
kanjistat 0.11.0 (2024-02-17)
- Lennart Finke is now a co-author.
New features
Function
kanjidistaccepts two newtypearguments “pc” and “pcweighted” for computing component distances based on (weighted) point clouds rather than bitmap images.Data sets
dstrokeditanddyehliadded with stroke edit and Yeh-Li (bag-of-radicals) distances between Jouyou kanji and (usually a bit more than) their closest ten neighbors. Based on the PhD thesis by Lars Yencken (2010).
kanjistat 0.10.0 (2024-01-01)
kanjistat 0.9.0 (2023-05-20)
kanjistat 0.8.0 (2023-05-06)
Documentation
More extensive readme file and main package vignette.
Add package website using pkgdown.