In this framework, you can think about a morpheme as being a tuple of features. ...

schoen · on Sept 22, 2019

Thanks, that's really neat (and since I know Latin, the analysis made perfect sense to me).

I suppose that you could, for example, account for the different conjugations and declensions by saying that they are also features of noun and verb stems that have to agree with endings that want to bind with them, right? Like "vid-" and "-et" is not just "sees - S.3" but also something like "see [+2conj]" and "S.3 [+2conj]" allowing them to bind with each other, where "-at" might be "S.3 [+1conj]" so it could bind with "am-" being "love [+1conj]", while "-et" doesn't bind with "am-" (except when interpreted as a different lexical item that adds [+subjunctive] to a [+1conj] stem?).

My next question is whether there are tools to facilitate writing parsers with this framework because it makes me want to write a Latin parser and see how well it does (and maybe how many formal syntactic ambiguities exist in Latin texts that we might not even notice most of the time).

jaclaz · on Sept 21, 2019

Yep, but, as soon as you exit very simple phrases subject/action/object, the approach may become complex to apply in practice, a couple (known) latin (tricky) examples (JFYI):

mala mala mala sunt bona

Soli soli soli

eindiran · on Sept 21, 2019

Unfortunately that's true, which is why linguistics is a field of study with its own journals, rather than something that can be summarized neatly in the space of an HN comment :P

This model really can account for quite complex language data though. For example, check out this account of auxiliary verbs in Basque: https://www.academia.edu/3112898/A_Distributed_Morphology_An...

Speaking to your examples: "mala mala mala sunt bona" isn't particularly difficult to analyze this way, you just need to realize that the "mala"s are different words (kind of like the famous English "Buffalo buffalo Buffalo buffalo buffalo buffalo Buffalo buffalo" sentence). If I remember the proverb correctly, it means "apples (mala) are good (sunt bona) for a painful jaw (mala mala)".

You need an analysis that allows adjectives to Merge with nouns iff they match case, gender and number, so that allows us to create a noun phrase "mala mala" in the instrumental ablative. Then you need a way to have the case, gender and number of subject bubble to the top of the phrase it will make with an auxiliary so that the adjective after the auxiliary is feature restricted to that case, gender and number. Once the elements of the auxiliary verb phrase have Merged, you get:

{{ mala, sunt }, bona }

Finally you have a rule that allows auxiliary verb phrases to Merge with noun phrases headed by an ablative. If you want the first "mala" to be the subject, then re-Merge it with the whole sentence so far, which in effect moves it to the top of the tree, leaving a trace in its original position.

I'm not sure what the second example means. My best guess is that it's the dative singular of 'sol', a matching masculine dative singular of 'solus' and a genitive singular of 'solum', so something like "for the only sun of the land". If that's correct, you need our previously used rule for Merging adjectives iff they match the noun in case, gender and number. Then you can add an additional rule that genitive nouns can be Merged with noun phrases (without any feature selection needing to take place) to form a new noun phrase.

Hopefully that shows that Merge and feature selection as mechanisms can be used outside of toy models, to actually account for real data.

jaclaz · on Sept 22, 2019

Your translations/guesses are correct, "mala mala mala sunt bona" is afaik an invented phrase, not entirely unlike "I Vitelli dei romani sono belli" (which is bilingual Latin/Italian, meaning in italian "The calves of the Romans are beautiful" but meaning in latin "Go, Vitellio at the sound of the Roman war god") to trick/have some fun of Latin students, while "Soli soli soli" was a phrase sometimes inscribed on sundials.

Anyway, yes, the Merge and feature can work just fine outside of "toy models" the note was about about they soon becoming complex.