[R-lang] Fitting a Zipf-Mandelbrot distribution

Ken Williams ken.williams at thomson.com
Tue May 29 09:14:41 PDT 2007


Excellent!  Thanks for the pointer.

 -Ken


On 5/25/07 5:13 PM, "Marco Baroni" <marco.baroni at unitn.it> wrote:

> Dear Ken,
> 
> Routines to fit a Zipf-Mandelbrot (probability) distribution are
> implemented in our zipfR library for R:
> 
> http://www.cogsci.uni-osnabrueck.de/~severt/zipfR
> 
> The library documentation also points to some literature on the topic
> (most notably, Baayens' 2001 Word Frequency Distributions book).
> 
> Regards,
> 
> Marco
> 
> 
> Ken Williams wrote:
>> Hi,
>> 
>> I've got some data (picture at http://limnus.com/~ken/zipf.jpg ) that seems
>> to very obviously follow a Zipf-Mandelbrot distribution, and I'm wondering
>> how to best fit the parameters of the distribution given the data.
>> 
>> I'm using the equation y = P(x + A)^-B as the canonical form of the
>> Zipf-Mandelbrot curve.
>> 
>> As a first attempt, I fixed the two endpoints (x=1 and y=1) to coincide with
>> the expected values on the graph, which analytically gives me values for P
>> and B as functions of A.  Then I twiddle around with values for A until it
>> looks nice.  On the attached graph, this means I end up with A=8.
>> 
>> This method pretty clearly overfits the endpoints, and is subject to the
>> whims of my eyeballs for A.  What would members of this list suggest for
>> alternative fitting methods?
>> 
>> Thanks,
>> 
>>  -Ken
>> 
>> _______________________________________________
>> R-lang mailing list
>> R-lang at ling.ucsd.edu
>> https://ling.ucsd.edu/mailman/listinfo.cgi/r-lang
>> 
> 



More information about the R-lang mailing list