5Another possibility would be to use EM (Dempster et al., 1977), but that will imply learning both a generation and an extraction system, something clearly outside the scope of this thesis.