We study the random variable Y-n representing the number of occurrences of a symbol a in a word of length n chosen at random in a regular language L subset of or equal to {a, b}*, where the random choice is defined via a non-negative rational formal series r of support L. Assuming that the transition matrix associated with r is primitive we obtain asymptotic estimates for the mean value and the variance of Y-n and present a central limit theorem for its distribution. Under a further condition on such a matrix, we also derive an asymptotic approximation of the discrete Fourier transform of Y-n that allows to prove a local limit theorem for Y-n. Further consequences of our analysis concern the growth of the coefficients in rational formal series; in particular, it turns out that, for a wide class of regular languages L, the maximum number of words of length n in L having the same number of occurrences of a given symbol is of the order of growth lambda(n)/rootn, for some constant lambda > 1.

On the number of occurrences of a symbol in words of regular languages / A. BERTONI, C. CHOFFRUT, M. GOLDWURM, V. LONATI. - In: THEORETICAL COMPUTER SCIENCE. - ISSN 0304-3975. - 302:1-3(2003), pp. 431-456.

On the number of occurrences of a symbol in words of regular languages

A. BERTONI;M. GOLDWURM
;
V. LONATI
2003

Abstract

We study the random variable Y-n representing the number of occurrences of a symbol a in a word of length n chosen at random in a regular language L subset of or equal to {a, b}*, where the random choice is defined via a non-negative rational formal series r of support L. Assuming that the transition matrix associated with r is primitive we obtain asymptotic estimates for the mean value and the variance of Y-n and present a central limit theorem for its distribution. Under a further condition on such a matrix, we also derive an asymptotic approximation of the discrete Fourier transform of Y-n that allows to prove a local limit theorem for Y-n. Further consequences of our analysis concern the growth of the coefficients in rational formal series; in particular, it turns out that, for a wide class of regular languages L, the maximum number of words of length n in L having the same number of occurrences of a given symbol is of the order of growth lambda(n)/rootn, for some constant lambda > 1.
automata and formal languages; discrete Fourier transform; Gaussian limit distribution; Perron-Frobenius theory; rational formal series
Settore INF/01 - Informatica
2003
Article (author)
File in questo prodotto:
File Dimensione Formato  
firenze_online.pdf

accesso aperto

Tipologia: Post-print, accepted manuscript ecc. (versione accettata dall'editore)
Dimensione 413.63 kB
Formato Adobe PDF
413.63 kB Adobe PDF Visualizza/Apri
tcs.pdf

accesso aperto

Tipologia: Publisher's version/PDF
Dimensione 350.83 kB
Formato Adobe PDF
350.83 kB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/4940
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 22
  • ???jsp.display-item.citation.isi??? 17
social impact