This page is under construction, whilst most of its content is password-protected.
Please get in touch for accessing the documents by sending a mail to – > email@example.com
Thank you for your patience.
Mining text in Chinese language can be quite challenging. For learning purposes, I developed a real-time RSS Chinese News Feed that can perform statistical analysis of unstructured data (word count, frequencies plots and clusters). The picture below shows successful encoding of linguistic … Continue reading →
Hello. 🙂 I am back with a post about forecasting, a subject I have always, carefully tried to avoid … for personal reasons. 😀 Forecasting is the science (or an art for some:) of analysing trends and estimating future outcome probabilities … Continue reading →
For streaming and handling financial data in R, we will need the libraries XML, and then work either with data.tables or timeseries (or other types). Here we look at both data.tables and timeseries. Run the following lines. In the line ‘url01 <- fread(“http://linktoyourquote“)’, change the … Continue reading →
The linguist who is not yet introduced to the universe of programming languages, may obviate with a ready-made software solution for computational linguistics. These can also be very useful in prototyping (or brainstorming phases of) automated analytic routines and software development. Wordsmith … Continue reading →
If one considers doing serious field and/or ethnographic work in social, interactional and behavioural sciences, one should consider getting acquainted with the softwares Elan developed by the Max Planck Institute for Psycholinguistics, Nijmegen (MPI Nijmegen henceforth), and Praat by (Boersma and Weenink, 2014). In my doctoral … Continue reading →