Search for chaotic structures in Turkish and English texts by detrended fluctuation and time series analyses
Yeditepe University/Department of Information Technologies
Last modified: May 30, 2006
— The possibility of chaotic structures in Turkish and English texts, as well as the possibility of using the pseudo-invariants in a reconstructed phase space as identifying characteristics for languages is investigated. Texts of length up to 83000 in both languages have been analysed. Two alternatives for the dependent variable in a time series analysis have been used. Word frequencies based on a corpus have been one alternative inspired by Zipf’s law. The other alternative is based on assigning values to the letters in a word as inspired by a random walk. A positive maximal liapunov exponent has been observed. Values of this exponent are different for the two languages. This and differing detrended fluctuation analysis results for the two languages for either parametrization imply that our analysis methods can point to differences in languages.