Meylan, S. C., & Griffiths, T. L. (preprint). The challenges of large-scale, web-based language datasets: Word length and predictability revisited. (link)
Nematzadeh, A., Meylan, S. C., & Griffiths, T. L. (2017). Evaluating vector-space models of word representation, or the unreasonable effectiveness of counting words near other words. Proceedings of the 39th Annual Conference of the Cognitive Science Society.(pdf)
Meylan, S. C., & Griffiths, T. L. (2015). A Bayesian framework for learning words from multiword utterances. Proceedings of the 37th Annual Conference of the Cognitive Science Society. (pdf)