Semantics in the Old English Poetic Line

Posted on March. 6.2024 by Stephen Harris

The case of Maldon

During an Independent Study on the Battle of Maldon this week, we noticed that in lines 109 and 110, the weapons of war were named in the third lift. The verbs were in the fourth:

grimme gegrundene garas fleogan

bogan wæran bysige bord ord onfeng.

A lift is another term for one of the four heavily weighted syllables in the OE poetic line. Whether some have more semantic force than others is a question raised by Professor Smirnitskaya of Moscow State University. Her student, Dr. Ilya Sverdlov of the Helsinki Institute of Advanced Study, gave a terrific paper on lifts and semantic force here at UMass many years ago.

I wrote a program to extract the third lift from every line. Recall that every OE poetic line has four major stresses, a caesura between the second and third stress, and alliteration across the caesura.

The program is in Python. For each line of the poem:

I remove OE stop-words
divide a line into half-lines (called the a-line and the b-line)
take the first letter of each word in the a-line in order to establish a pattern of alliteration in the b-line
return the third lift

At the moment, the third lift is unformatted. But I’d like to format it in color if it’s an alliterated lift. That’s for later. Also for later is adding some functionality so that this program can retrieve any lift from any poem along with the part of speech of that lift (e.g. “garas”, noun plural). First, the results. Then, the code. NB. Some of the results are inaccurate—I’ve marked those with a Kleene star.

1 ['not', 'line'], ['brocen', 'wurde.']	brocen
2 ['het', 'hyssa', 'hwæne'], ['hors', 'forlætan,']	hors
3 ['feor', 'afysan'], ['forð', 'gangan,']	forð
4 ['hicgan', 'handum'], ['hige', 'godum.']	hige
5 ['offan', 'mæg'], ['ærest', 'onfunde']	onfunde*
6 ['eorl', 'nolde'], ['yrhðo', 'geþolian,']	yrhðo
7 ['let', 'handon'], ['leofne', 'fleogan']	leofne
8 ['hafoc', 'holtes'], ['hilde', 'stop.']	hilde
9 ['man', 'mihte', 'oncnawan'], ['cniht', 'nolde']	cniht
10 ['wacian', 'wige'], ['wæpnum', 'feng.']	wæpnum
11 ['wolde', 'eadric'], ['ealdre', 'gelæstan']	ealdre
12 ['frean', 'gefeohte;'], ['ongan', 'forð', 'beran']	forð
13 ['gar', 'guþe.'], ['hæfde', 'god', 'geþanc']	god
14 ['handum'], ['healdan', 'mihte']	healdan
15 ['bord', 'brad', 'swurd;'], ['beot', 'gelæste']	beot
16 ['ætforan', 'frean'], ['feohtan', 'sceolde.']	feohtan
17 ['þær', 'byrhtnoð', 'ongan'], ['beornas', 'trymian,']	beornas
18 ['rad', 'rædde,'], ['rincum', 'tæhte']	rincum
19 ['hu', 'sceoldon', 'standan'], ['þone', 'stede', 'healdan']	stede
20 ['bæd', 'hyra', 'randas'], ['rihte', 'heoldon']	rihte
21 ['fæste', 'folman'], ['forhtedon', 'na.']	forhtedon
22 ['hæfde', 'folc'], ['fægere', 'getrymmed,']	fægere
23 ['lihte', 'leodon'], ['þær', 'leofost', 'wæs,']	leofost
24 ['þær', 'heorðwerod'], ['holdost', 'wiste.']	holdost
25 ['stod', 'stæðe,'], ['stiðlice', 'clypode']	stiðlice
26 ['wicinga', 'ar,'], ['wordum', 'mælde;']	wordum
27 ['beot', 'abead'], ['brimliþendra']	brimliþendra
28 ['ærænde', 'eorle'], ['þær', 'ofre', 'stod:']	þær*
29 ['sendon'], ['sæmen', 'snelle,']	sæmen
30 ['heton', 'secgan'], ['most', 'sendan', 'raðe']	sendan
31 ['beagas', 'gebeorge;'], ['betere', 'is']	betere
32 ['þisne', 'garræs'], ['gafole', 'forgyldon']	gafole
33 ['þonne', 'hearde'], ['hilde', 'dælon.']	hilde
34 ['þurfe', 'spillan'], ['spedaþ', 'þam;']	spedaþ
35 ['willað', 'golde'], ['grið', 'fæstnian.']	grið
36 ['gyf', 'gerædest'], ['her', 'ricost', 'eart']	her
37 ['þine', 'leoda'], ['lysan', 'wille,']	lysan
38 ['syllan', 'sæmannum'], ['hyra', 'sylfra', 'dom']	sylfra
39 ['feoh', 'freode'], ['niman', 'frið', 'us,']	frið
40 ['willaþ', 'sceattum'], ['scype', 'gangan,']	scype
41 ['flot', 'feran'], ['friþes', 'healdan.']	friþes
42 ['byrhtnoð', 'maþelode,'], ['bord', 'hafenode,']	bord
43 ['wand', 'wacne', 'æsc,'], ['wordum', 'mælde,']	wordum
44 ['yrre', 'anræd'], ['ageaf', 'andsware:']	ageaf
45 ['gehyrst', 'þu,', 'sælida,'], ['folc', 'segeð?']	segeð*
46 ['willað', 'gafole'], ['garas', 'syllan']	garas
47 ['ættrynne', 'ord'], ['ealde', 'swurd,']	ealde
48 ['heregeatu'], ['hilde', 'deah.']	hilde
49 ['brimmanna', 'boda,'], ['abeod', 'ongean:']	abeod
50 ['sege', 'þinum', 'leodum'], ['miccle', 'laþre', 'spell,']	laþre
51 ['her', 'stynt', 'unforcuð'], ['eorl', 'werode']	eorl
52 ['wile', 'gealgean'], ['eþel', 'þysne,']	eþel
53 ['æþelredes', 'eard'], ['ealdres', 'mines']	ealdres
54 ['folc', 'foldan.'], ['feallan', 'sceolon']	feallan
55 ['hæþene', 'hilde!'], ['heanlic', 'þinceð']	heanlic
56 ['urum', 'sceattum'], ['scype', 'gangon']	scype
57 ['unbefohtene,'], ['þus', 'feor', 'hider']	þus*
58 ['urne', 'eard'], ['becomon.']	becomon*
59 ['sceole', 'softe'], ['sinc', 'gegangan;']	sinc
60 ['sceal', 'ord', 'ecg'], ['geseman']	geseman
61 ['grim', 'guðplega'], ['gofol', 'syllon.']	gofol
62 ['het', 'bord', 'beran,'], ['beornas', 'gangan']	beornas
63 ['easteðe'], ['ealle', 'stodon.']	ealle
64 ['mihte', 'þær', 'wætere'], ['werod', 'oðrum;']	werod
65 ['þær', 'com', 'flowende'], ['flod', 'ebban,']	flod
66 ['lucon', 'lagustreamas.'], ['lang', 'þuhte']	lang
67 ['hwænne', 'togædere'], ['garas', 'beron.']	garas
68 ['þær', 'pantan', 'stream'], ['prasse', 'bestodon']	prasse
69 ['eastseaxena', 'ord'], ['æschere.']	æschere.
70 ['mihte', 'hyra', 'ænig'], ['oþrum', 'derian']	oþrum
71 ['flanes', 'flyht'], ['fyl', 'gename.']	fyl
72 ['flod', 'ut', 'gewat.'], ['flotan', 'stodon', 'gearowe']	flotan
73 ['wicinga', 'fela,'], ['wiges', 'georne.']	wiges
74 ['het', 'hæleða', 'hleo'], ['healdan', 'bricge']	healdan
75 ['wigan', 'wigheardne,'], ['wæs', 'haten', 'wulfstan,']	wæs
76 ['cafne', 'cynne;'], ['wæs', '?eolan', 'sunu']	?eolan
77 ['ðone', 'forman', 'man'], ['francan', 'ofsceat']	francan
78 ['þær', 'baldlicost'], ['bricge', 'stop.']	bricge
79 ['þær', 'stodon', 'wulfstane'], ['wigan', 'unforhte,']	wigan
80 ['ælfere', 'maccus,'], ['modige', 'twegen,']	modige
81 ['noldon', 'forda'], ['fleam', 'gewyrcan,']	fleam
82 ['fæstlice'], ['fynd', 'weredon']	fynd
83 ['wæpna'], ['wealdan', 'moston.']	wealdan
84 ['ongeaton'], ['georne', 'gesawon']	georne
85 ['þær', 'bricgweardas'], ['bitere', 'fundon,']	bitere
86 ['ongunnon', 'lytegian'], ['laðe', 'gystas:']	laðe
87 ['bædon', 'upgang'], ['agan', 'moston,']	agan
88 ['þone', 'ford', 'faran,'], ['feþan', 'lædan.']	feþan
89 ['eorl', 'ongan'], ['ofermode']	ofermode
90 ['alyfan', 'landes', 'fela'], ['laþere', 'ðeode.']	laþere
91 ['ongan', 'ceallian'], ['cald', 'wæter']	cald
92 ['byrhtelmes', 'bearn'], ['beornas', 'gehlyston:']	beornas
93 ['is', 'gerymed;'], ['gað', 'ricene', 'us,']	gað
94 ['guman', 'guþe.'], ['god', 'ana', 'wat']	god
95 ['wælstowe'], ['wealdan', 'mote.']	wealdan
96 ['wodon', 'wælwulfas'], ['wætere', 'murnon,']	wætere
97 ['wicinga', 'werod'], ['west', 'pantan,']	west
98 ['scir', 'wæter'], ['scyldas', 'wegon,']	scyldas
99 ['lidmen', 'lande'], ['linde', 'bæron.']	linde
100 ['þær', 'ongean', 'gramum'], ['gearowe', 'stodon']	gearowe
101 ['byrhtnoð', 'beornum;'], ['bordum', 'het']	bordum
102 ['wyrcan', 'þone', 'wihagan'], ['werod', 'healdan']	werod
103 ['fæste', 'feondum.'], ['wæs', 'feohte', 'neh']	feohte
104 ['tir', 'getohte.'], ['wæs', 'tid', 'cumen']	tid
105 ['þær', 'fæge', 'men'], ['feallan', 'sceoldon.']	feallan
106 ['þær', 'wearð', 'hream', 'ahafen.'], ['hremmas', 'wundon,']	hremmas
107 ['earn', 'æses', 'georn.'], ['wæs', 'eorþan', 'cyrm.']	eorþan
108 ['leton', 'folman'], ['feolhearde', 'speru,']	feolhearde
109 ['gegrundene'], ['garas', 'fleogan.']	garas
110 ['bogan', 'wæron', 'bysige;'], ['bord', 'ord', 'onfeng.']	bord
111 ['biter', 'wæs', 'beaduræs.'], ['beornas', 'feollon']	beornas
112 ['gehwæðere', 'hand,'], ['hyssas', 'lagon.']	hyssas
113 ['wund', 'wearð', 'wulfmær,'], ['wælræste', 'geceas']	wælræste
114 ['byrhtnoðes', 'mæg;'], ['billum', 'wearð']	billum
115 ['swuster', 'sunu'], ['swiðe', 'forheawen.']	swiðe
116 ['þær', 'wearð', 'wicingum'], ['wiþerlean', 'agyfen.']	wiþerlean
117 ['gehyrde', 'eadweard'], ['anne', 'sloge']	anne
118 ['swiðe', 'swurde,'], ['swenges', 'wyrnde']	swenges
119 ['fotum', 'feoll'], ['fæge', 'cempa.']	fæge
120 ['ðeoden'], ['þanc', 'gesæde']	þanc
121 ['burþene'], ['byre', 'hæfde.']	byre
122 ['stemnetton'], ['stiðhicgende']	stiðhicgende
123 ['hysas', 'hilde;'], ['hogodon', 'georne']	hogodon
124 ['þær', 'orde'], ['ærost', 'mihte']	ærost
125 ['fægean', 'men'], ['feorh', 'gewinnan,']	feorh
126 ['wigan', 'wæpnum.'], ['wæl', 'feol', 'eorðan.']	wæl
127 ['stodon', 'stædefæste;'], ['stihte', 'byrhtnoð,']	stihte
128 ['bæd', 'hyssa', 'gehwylc'], ['hogode', 'wige']	hogode
129 ['denon', 'wolde'], ['dom', 'gefeohtan.']	dom
130 ['wod', 'wiges', 'heard,'], ['wæpen', 'up', 'ahof,']	wæpen
131 ['bord', 'gebeorge,'], ['beornes', 'stop.']	beornes
132 ['eode', 'anræd'], ['eorl', 'ceorle;']	eorl
133 ['ægþer', 'hyra', 'oðrum'], ['yfeles', 'hogode.']	hogode*
134 ['sende', 'særinc'], ['suþerne', 'gar']	suþerne
135 ['gewundod', 'wearð'], ['wigena', 'hlaford.']	wigena
136 ['sceaf', 'scylde'], ['sceaft', 'tobærst']	sceaft
137 ['spere', 'sprengde'], ['sprang', 'ongean.']	sprang
138 ['gegremod', 'wearð', 'guðrinc;'], ['gare', 'stang']	gare
139 ['wlancne', 'wicing'], ['wunde', 'forgeaf.']	wunde
140 ['frod', 'wæs', 'fyrdrinc:'], ['let', 'francan', 'wadan']	francan
141 ['hysses', 'hals,'], ['hand', 'wisode']	hand
142 ['færsceaðan'], ['feorh', 'geræhte.']	feorh
143 ['oþerne'], ['ofstlice', 'sceat']	ofstlice
144 ['byrne', 'tobærst;'], ['wæs', 'breostum', 'wund']	breostum
145 ['hringlocan;'], ['heortan', 'stod']	heortan
146 ['ætterne', 'ord.'], ['eorl', 'wæs', 'bliþra;']	eorl
147 ['hloh', 'modi', 'man,'], ['sæde', 'metode', 'þanc']	metode
148 ['dægweorces'], ['drihten', 'forgeaf.']	drihten
149 ['forlet', 'drenga'], ['daroð', 'handa']	daroð
150 ['fleogan', 'folman'], ['forð', 'gewat']	forð
151 ['ðone', 'æþelan'], ['æþelredes', 'þegen.']	æþelredes
152 ['healfe', 'stod'], ['hyse', 'unweaxen,']	hyse
153 ['cniht', 'gecampe,'], ['full', 'caflice']	caflice
154 ['bræd', 'beorne'], ['blodigne', 'gar,']	blodigne
155 ['wulfstanes', 'bearn,'], ['wulfmær', 'geonga']	wulfmær
156 ['forlet', 'forheardne'], ['faran', 'ongean;']	faran
157 ['ord', 'gewod'], ['eorþan', 'læg']	eorþan
158 ['þeoden'], ['þearle', 'geræhte.']	þearle
159 ['eode', 'gesyrwed'], ['secg', 'eorle;']	eorle;
160 ['wolde', 'beornes'], ['beagas', 'gefetigan']	beagas
161 ['reaf', 'hringas'], ['gerenod', 'swurd.']	gerenod
162 ['byrhtnoð', 'bræd'], ['bill', 'sceðe']	bill
163 ['brad', 'bruneccg'], ['byrnan', 'sloh.']	byrnan
164 ['raþe', 'hine', 'gelette'], ['lidmanna']	lidmanna
165 ['eorles'], ['earm', 'amyrde.']	earm
166 ['feoll', 'foldan'], ['fealohilte', 'swurd;']	fealohilte
167 ['mihte', 'gehealdan'], ['heardne', 'mece,']	heardne,
168 ['wæpnes', 'wealdan.'], ['gyt', 'word', 'gecwæð']	word
169 ['har', 'hilderinc,'], ['hyssas', 'bylde,']	hyssas
170 ['bæd', 'gangan', 'forð'], ['gode', 'geferan;']	gode
171 ['mihte', 'fotum', 'leng'], ['fæste', 'gestandan.']	fæste
172 ['heofenum', 'wlat:'], ['no', 'half', 'line']	half
173 ['geþancie', 'þe,'], ['ðeoda', 'waldend,']	ðeoda
174 ['ealra', 'þæra', 'wynna'], ['worulde', 'gebad.']	worulde
175 ['ah,', 'milde', 'metod,'], ['mæste', 'þearfe']	mæste
176 ['minum', 'gaste'], ['godes', 'geunne']	godes
177 ['sawul'], ['siðian', 'mote']	siðian
178 ['geweald,'], ['þeoden', 'engla,']	þeoden
179 ['friþe', 'feran.'], ['eom', 'frymdi']	frymdi
180 ['helsceaðan'], ['hynan', 'moton.']	hynan
181 ['hine', 'heowon'], ['hæðene', 'scealcas']	hæðene
182 ['begen', 'beornas'], ['big', 'stodon:']	big
183 ['ælfnoð', 'wulmær'], ['begen', 'lagon']	begen
184 ['onemn', 'hyra', 'frean'], ['feorh', 'gesealdon.']	feorh
185 ['bugon', 'beaduwe'], ['þær', 'beon', 'noldon.']	beon
186 ['þær', 'wearð', 'oddan', 'bearn'], ['ærest', 'fleame']	ærest
187 ['godric', 'guþe'], ['þone', 'godan', 'forlet']	godan
188 ['mænigne'], ['mearh', 'gesealde.']	mearh
189 ['gehleop', 'þone', 'eoh'], ['ahte', 'hlaford']	ahte
190 ['gerædum'], ['riht', 'wæs,']	riht
191 ['broðru'], ['begen', 'ærndon,']	begen
192 ['godwine', 'godwig'], ['guþe', 'gymdon,']	guþe
193 ['wendon', 'wige'], ['þone', 'wudu', 'sohton,']	wudu
194 ['flugon', 'fæsten'], ['hyra', 'feore', 'burgon,']	feore
195 ['manna', 'ma'], ['þonne', 'ænig', 'mæð', 'wære']	mæð
196 ['gyf', 'geearnunga'], ['ealle', 'gemundon']	gemundon
197 ['duguþe'], ['gedon', 'hæfde.']	gedon
198 ['offa', 'dæg'], ['asæde']	asæde
199 ['meþelstede'], ['gemot', 'hæfde']	gemot
200 ['þær', 'modiglice'], ['manega', 'spræcon']	manega
201 ['þearfe'], ['þolian', 'noldon.']	þolian
202 ['wearð', 'afeallen'], ['folces', 'ealdor,']	folces
203 ['æþelredes', 'eorl;'], ['ealle', 'gesawon']	ealle
204 ['heorðgeneatas'], ['hyra', 'heorra', 'læg.']	hyra
205 ['ðær', 'wendon', 'forð'], ['wlance', 'þegenas,']	wlance
206 ['unearge', 'men'], ['efston', 'georne;']	efston
207 ['woldon', 'ealle'], ['oðer', 'twega,']	oðer
208 ['lif', 'forlætan'], ['oððe', 'leofne', 'gewrecan.']	leofne
209 ['bylde', 'forð'], ['bearn', 'ælfrices,']	bearn
210 ['wiga', 'wintrum', 'geong'], ['wordum', 'mælde;']	wordum
211 ['ælfwine', 'cwæð,'], ['ellen', 'spræc:']	ellen
212 ['gemunon', 'mæla'], ['meodo', 'spræcon,']	meodo
213 ['þonne', 'bence'], ['beot', 'ahofon']	beot
214 ['hæleð', 'healle,'], ['heard', 'gewinn.']	heard
215 ['mæg', 'cunnian'], ['cene', 'sy.']	cene
216 ['wylle', 'mine', 'æþelo'], ['eallum', 'gecyþan,']	eallum
217 ['wæs', 'myrcon'], ['miccles', 'cynnes:']	miccles
218 ['wæs', 'ealda', 'fæder'], ['ealhelm', 'haten,']	ealhelm
219 ['wis', 'ealdorman'], ['woruldgesælig.']	woruldgesælig.
220 ['sceolon', 'þeode'], ['þegenas', 'ætwitan']	þegenas
221 ['fyrde'], ['feran', 'wille,']	feran
222 ['eard', 'gesecan,'], ['ealdor', 'ligeð']	ealdor
223 ['forheawen', 'hilde.'], ['is', 'hearma', 'mæst:']	hearma
224 ['wæs', 'ægþer', 'mæg'], ['hlaford.']	hlaford.
225 ['forð', 'eode,'], ['fæhðe', 'gemunde,']	fæhðe
226 ['orde'], ['anne', 'geræhte']	anne
227 ['flotan', 'folce,'], ['foldan', 'læg']	foldan
228 ['forwegen', 'wæpne.'], ['ongan', 'winas', 'manian']	winas
229 ['frynd', 'geferan'], ['forð', 'eodon.']	forð
230 ['offa', 'gemælde,'], ['æscholt', 'asceoc:']	æscholt
231 ['þu,', 'ælfwine,', 'hafast'], ['ealle', 'gemanode']	ealle
232 ['þegenas', 'þearfe,'], ['þeoden', 'lið']	þeoden
233 ['eorl', 'eorðan.'], ['is', 'eallum', 'þearf']	eallum
234 ['æghwylc'], ['oþerne', 'bylde']	oþerne
235 ['wigan', 'wige'], ['wæpen', 'mæge']	wæpen
236 ['habban', 'healdan'], ['heardne', 'mece,']	heardne
237 ['gar', 'god', 'swurd.'], ['godric', 'hæfð,']	godric
238 ['earh', 'oddan', 'bearn,'], ['ealle', 'beswicene.']	ealle
239 ['wende', 'formoni', 'man,'], ['meare', 'rad']	meare
240 ['wlancan', 'wicge,'], ['wære', 'hlaford;']	wære
241 ['wearð', 'her', 'felda'], ['folc', 'totwæmed,']	folc
242 ['scyldburh', 'tobrocen.'], ['abreoðe', 'angin,']	abreoðe
243 ['her', 'manigne'], ['man', 'aflymde!']	man
244 ['leofsunu', 'gemælde'], ['linde', 'ahof,']	linde
245 ['bord', 'gebeorge;'], ['beorne', 'oncwæð:']	beorne
246 ['gehate'], ['heonon', 'nelle']	heonon
247 ['fleon', 'fotes', 'trym,'], ['wille', 'furðor', 'gan,']	furðor
248 ['wrecan', 'gewinne'], ['winedrihten.']	winedrihten.
249 ['þurfon', 'embe', 'sturmere'], ['stedefæste', 'hælæð']	stedefæste
250 ['wordum', 'ætwitan,'], ['wine', 'gecranc,']	wine
251 ['hlafordleas'], ['ham', 'siðie,']	ham
252 ['wende', 'wige,'], ['sceal', 'wæpen', 'niman']	wæpen
253 ['ord', 'iren.'], ['ful', 'yrre', 'wod,']	ful
254 ['feaht', 'fæstlice,'], ['fleam', 'forhogode.']	fleam
255 ['dunnere', 'cwæð,'], ['daroð', 'acwehte']	daroð
256 ['unorne', 'ceorl,'], ['eall', 'clypode,']	clypode,
257 ['bæd', 'beorna', 'gehwylc'], ['byrhtnoð', 'wræce:']	byrhtnoð
258 ['mæg', 'na', 'wandian'], ['wrecan', 'þenceð']	wrecan
259 ['frean', 'folce,'], ['feore', 'murnan.']	feore
260 ['forð', 'eodon,'], ['feores', 'rohton.']	feores
261 ['ongunnon', 'hiredmen'], ['heardlice', 'feohtan']	heardlice
262 ['grame', 'garberend'], ['god', 'bædon']	god
263 ['moston', 'gewrecan'], ['hyra', 'winedrihten']	hyra
264 ['hyra', 'feondum'], ['fyl', 'gewyrcan.']	fyl
265 ['gysel', 'ongan'], ['geornlice', 'fylstan:']	geornlice
266 ['wæs', 'norðhymbron'], ['heardes', 'cynnes,']	heardes
267 ['ecglafes', 'bearn,'], ['wæs', 'æscferð', 'nama.']	æscferð
268 ['wandode', 'na'], ['wigplegan,']	wigplegan,
269 ['fysde', 'forð'], ['flan', 'genehe.']	flan
270 ['hwilon', 'bord', 'sceat,'], ['hwilon', 'beorn', 'tæsde;']	hwilon
271 ['æfre', 'embe', 'stunde'], ['sealde', 'wunde']	sealde
272 ['wæpna'], ['wealdan', 'moste.']	wealdan
273 ['gyt', 'orde', 'stod'], ['eadweard', 'langa,']	eadweard
274 ['gearo', 'geornful'], ['gylpwordum', 'spræc']	gylpwordum
275 ['nolde', 'fleogan'], ['fotmæl', 'landes,']	fotmæl
276 ['bæc', 'bugan'], ['betera', 'leg.']	betera
277 ['bræc', 'þone', 'bordweall'], ['beornas', 'feaht']	beornas
278 ['sincgyfan'], ['sæmannum']	sæmannum
279 ['wurðlice', 'wrec'], ['wæle', 'læge.']	wæle
280 ['dyde', 'æþeric,'], ['æþele', 'gefera,']	æþele
281 ['fus', 'forðgeorn'], ['feaht', 'eornoste.']	feaht
282 ['sibyrhtes', 'broðor'], ['swiðe', 'mænig', 'oþer']	swiðe
283 ['clufon', 'cellod', 'bord,'], ['cene', 'weredon.']	cene
284 ['bærst', 'bordes', 'lærig,'], ['byrne', 'sang']	byrne
285 ['gryreleoða', 'sum.'], ['guðe', 'sloh']	guðe
286 ['offa', 'þone', 'sælidan'], ['eorðan', 'feoll,']	eorðan
287 ['ðær', 'gaddes', 'mæg'], ['grund', 'gesohte.']	grund
288 ['raðe', 'wearð', 'hilde'], ['offa', 'forheawen;']	offa
289 ['hæfde', 'geforþod'], ['frean', 'gehet']	gehet
290 ['beotode'], ['beahgifan']	beahgifan
291 ['sceoldon', 'begen'], ['burh', 'ridan']	burh
292 ['hale', 'hame,'], ['oððe', 'here', 'crincgan,']	here
293 ['wælstowe'], ['wundum', 'sweltan;']	wundum
294 ['læg', 'ðegenlice'], ['ðeodne', 'gehende.']	ðeodne
295 ['wearð', 'borda', 'gebræc.'], ['brimmen', 'wodon']	brimmen
296 ['guðe', 'gegremode;'], ['gar', 'þurhwod']	gar
297 ['fæges', 'feorhhus.'], ['forð', 'eode', 'wistan,']	forð
298 ['þurstanes', 'sunu'], ['secgas', 'feaht;']	secgas
299 ['wæs', 'geþrange'], ['hyra', 'þreora', 'bana']	hyra
300 ['wigelmes', 'bearn'], ['wæle', 'læge.']	wæle
301 ['þær', 'wæs', 'stið', 'gemot.'], ['stodon', 'fæste']	stodon
302 ['wigan', 'gewinne.'], ['wigend', 'cruncon']	wigend
303 ['wundum', 'werige.'], ['wæl', 'feol', 'eorþan.']	wæl
304 ['oswold', 'eadwold'], ['ealle']	ealle
305 ['begen', 'gebroþru'], ['beornas', 'trymedon,']	beornas
306 ['hyra', 'winemagas'], ['wordon', 'bædon']	wordon
307 ['þær', 'ðearfe'], ['þolian', 'sceoldon,']	þolian
308 ['unwaclice'], ['wæpna', 'neotan.']	wæpna
309 ['byrhtwold', 'maþelode,'], ['bord', 'hafenode']	bord
310 ['wæs', 'eald', 'geneat,'], ['æsc', 'acwehte;']	æsc
311 ['ful', 'baldlice'], ['beornas', 'lærde:']	beornas
312 ['hige', 'sceal', 'heardra,'], ['heorte', 'cenre,']	heorte
313 ['mod', 'sceal', 'mare'], ['mægen', 'lytlað.']	mægen
314 ['her', 'lið', 'ealdor'], ['eall', 'forheawen']	eall
315 ['god', 'greote.'], ['mæg', 'gnornian']	gnornian
316 ['þisum', 'wigplegan'], ['wendan', 'þenceð.']	wendan
317 ['eom', 'frod', 'feores;'], ['wille,']	wille,
318 ['healfe'], ['minum', 'hlaforde,']	hlaforde,
319 ['leofan', 'men'], ['licgan', 'þence.']	licgan
320 ['æþelgares', 'bearn'], ['ealle', 'bylde']	bylde
321 ['godric', 'guþe.'], ['gar', 'forlet']	gar
322 ['wælspere', 'windan'], ['wicingas;']	wicingas;
323 ['folce'], ['fyrmest', 'eode,']	fyrmest
324 ['heow', 'hynde'], ['hilde', 'gecranc.']	hilde
325 ['næs', 'na', 'godric'], ['guðe', 'forbeah']	guðe

And now the code.

First, the list of stop-words in Old English:

&
a
ac
æ
æfter
ær
ære
æt
after
and
ba
bæm
be
bi
binnan
bu
butan
buton
ða
ðæm
ðære
ðæs
ðæt
ðam
ðan
ðar
ðara
ðare
ðas
ðe
ðeah
ðenden
ðeos
ðes
ðider
ðin
ðinre
ðis
ðisra
ðisre
ðissa
ðisse
ðisses
ðissum
ðon
ðrie
ðritig
ðu
ðurh
ðy
ðys
eac
eala
eft
eow
eower
for
forþon
forðon
forðam
forþan

fram
from
ge
gea
geo
gif
git
he
heo
heom
heora
hi
hie
hiera
him
hiora
hira
hire
his
hit
hwa
hwæm
hwæs
hwæt
hwam
hwile
hwon
hwonne
hwy
ic
in
inc
incer
inne
iu
lice
me
mec
mid
midd
min
minne
ne
nu
oð
oðæt
oðat
oððæt
oððat
of
ofer
oft
on
ond
se
seo
siððan
siðþan
sum
sume
swa

swelce
swilce
swylce
sylfa
sylfe
sylfes
sylfum
to
unc
uncer
under
ure
us
we
wið
wit
ymbe
þa
þæm
þære
þæs
þæt
þam
þan
þar
þara
þare
þas
þat
þe
þeah
þenden
þeos
þes
þider
þin
þinre
þis
þisra
þisre
þissa
þisse
þisses
þissum
þon
þu
þurh
þy
þys

def flatten(mylist):
    flatlist = []
    for x in mylist:
        if type(x) == tuple or type(x) == list or type(x) == set:
            for y in x:
                flatlist.append(y)
        else:
            flatlist.append(x)

        if type(x) == list:
            x = flatten(x)
            continue

    return flatlist


def get_text(title):
    path = 'textsdata/'
    lines = []
    with open(path + title) as fh:
        temp_lines = fh.readlines()
        fh.close()
    for line in temp_lines:
        lines.append(line.rstrip(' \n'))
    return lines


def find_lifts(line):
    line_returns = []
    halflines = line.split('|')     # halfines[0] is the a-line, halflines[1] is the b-line
    try:
        a_half = halflines[0].split()
    except IndexError:
        a_half = None
    try:
        b_half = halflines[1].split()
    except IndexError:
        b_half = None
    if a_half:
        a_half = halflines[0].split()
        line_returns.append(remove_stopwords(a_half))
    if b_half:
        b_half = halflines[1].split()
        line_returns.append(remove_stopwords(b_half))

    return line_returns


def remove_stopwords(phrase: list):
    # use only after loading stopwords in MAIN
    phrase_return = []
    for word in phrase:
        if word not in stopwords:
            phrase_return.append(word)
    return flatten(phrase_return)


def get_alliteration(half: list) -> object:
    governing_letters = [w2[:1] for w2 in half[0]]      # from the first half
    for w3 in half[1]:
        if w3[:1] in governing_letters:
            return w3


with open('textsdata/oe_stopwords.txt', 'r') as fh2:
    stopword_temp = fh2.readlines()
    fh2.close()
stopwords = [w.rstrip('\n') for w in stopword_temp]

maldon_text = get_text('maldon_formatted.txt')

counter = 0
for maldon_line in maldon_text:
    counter += 1
    this_line = find_lifts(maldon_line)
    print(counter, this_line, end='\t\t')

    third = get_alliteration(this_line)
    if third is None:
        third = this_line[1][0]
    print(f'{third: >}')

If you would like the formatted text of Maldon please write me.

Bad Coding Jokes

Posted on April. 3.2019 by Stephen Harris

I’ll post more as I think of them. Meanwhile, solve for theta, launch the beta!

Q. What does a tired coder drink?
A. rpint()

About three-quarters there

Posted on December. 3.2018 by Stephen Harris

Screen shot 12/2/2018.You’re only as good as your data

That is the lesson here. Single brackets [x] indicate an entry in Ondrej Tichy‘s Bosworth-Toller, which I edited into a json file. Double brackets [[x]] indicate an entry in the raw data of Ondrej’s BT, if the word wasn’t found in the json file. Empty brackets indicate no returned value. A word like mæg can mean ‘may’ (V) or ‘kin’ (N). The word didn’t make the structured data, and the raw data mischaracterized it in its verbal form, so the parser didn’t pick up the verb.

Rather than spend days improving the data from Bosworth-Toller, or overwhelm the servers in Prague with BeautifulSoup requests, I’m going to scrape word lists from Old English sites, and OCR some glossaries from freely-available books. If I can compile 10 or 20 word lists and zip them to grammatical information, I can get a percentage of likelihood for any given word. Second, I can use the York-Helsinki Parsed Corpus of Aelfric’s prose through CLTK. It won’t catch all of the words, but might be a help.

I’ve written a simple script to inflect any noun or adjective and to conjugate any verb. I can work it backwards to find the root form of a word, then send that to BT.

Final step is to run the words and forms through a syntactic parser. If it sees ne, which carries a weight of 5, then it increases the likelihood that the next word is a verb, since negative particles almost always sit next to verbs in OE. (One can check that with a bigram search.) Similar proximity searches to prepositions, pronouns, and so forth help to assess weights (probabilities).

Once this next layer is completed, and the weights adjusted, I will have a decent control to check the more experimental parser.

Fulbright Project

Posted on October. 17.2018 by Stephen Harris

View from Dunton Tower at Carleton University looking north along the Rideau River towards the city of Ottawa.

I am very fortunate this year to have received a Fulbright award. The College of Humanities and Fine Arts at UMass made it possible for me to spend the academic year at Carleton University in Ottawa, Ontario, Canada. While here, I’m working on a natural-language parser of Old English, which I will use to create a semantic map of Old English nouns. In short, I want a computer to recognize an Old English noun and then find all words associated with it. Nouns are names for entities in the world. So a semantic map tells us something about how a language permits people to associate qualities with entities.

Following in the footsteps of Artificial Intelligence researchers like Waleed Ammar of the Paul Allen Institute, I will be using untagged corpora—that is, texts that no one has marked up for grammatical information. I would like to interfere with the data as little as possible.

What makes this project different from similar NLP projects is my aim. I want to produce a tool that can be used by literary critics. I am not interested in improving Siri or Alexa or a pop-up advertisement that wants to sell you shoes. Neither is my aim to propose hypotheses about natural languages, which is a general aim of linguistics-related NLPs. So, the object of my inquiry is artful writing, consciously patterned language.

STAGE ONE

The first stage is to write a standard NLP parser using tagged corpora. The standard parser will serve to check any results of the non-standard parser. Thanks to the generosity of Dr. Ondrej Tichý of Charles University in Prague, the standard parser is now equipped with a list of OE lexemes, parsed for form. A second control mechanism is the York-Helsinki Parsed Corpus of Old English, which is a tagged corpus of most of Aelfric’s Catholic sermons.

STAGE TWO

At the same time, I divided the OE corpus into genres. In poetic texts, dragons can breathe fire. But in non-fictional texts, dragons don’t exist. So a semantic field drawn around dragons will change depending on genre. I am subdividing the poetry according to codex, and then according to age (as far as is possible) to account for semantic shift. Those subdivisions will have to be revised, then abandoned as the AI engine gets running. (I’ll be using the python module fastai to implement an AI.)

Notes

Unicode. You’d think that searching a text string for another text string would be straightforward. But nothing is easy! A big part of preparing the Old English Corpus for manipulation is ensuring that the bits and bytes are in the right order. I had a great deal of difficulty opening the Bosworth-Toller structured data. It was in UTF-16 encoding, similar to the character encoding used on Windows machines. When I tried to open it via python, the interpreter threw an error. It turns out, Unicode is far more complex than one imagines. Although I can find ð and þ, for example, I cannot find them as the first characters of words after a newline (or even regex \b). Another hurdle.

Overcame it! The problem was in the data. For reasons unknown to me, Microsoft Windows encodes runic characters differently than expected. So the solution was to use a text editor (BB Edit), go into the data, and replace all original thorns with regular thorns. Same for eth, asc, and so forth. Weirdly, it didn’t look like I was doing anything: both thorns looked identical on the screen.

Screen shot of parser guts so far. Markup is data from Tichy’s Bosworth-Toller. Inflect receives Markup, then returns inflections based on the gender of strong nouns. Variants and special cases have not yet been included.

To finish STAGE ONE, I’ll now inflect every noun, pronoun, and adjective, then conjugate every verb as they come in (on the fly). Andrej Tichý at Charles University in Prague, who very generously sent me his structured data, took a slightly different approach: he generated all the permutations first and placed them into a list. Finally, as a sentence comes in, I’ll send off each word to the parser, receive its markup and inflections/conjugations, then search the markup for matches.

Square Roots

Posted on June. 5.2018 by Stephen Harris

My daughter and I were recently playing with python’s square root function. She discovered that if you evaluate an even number of ones, the square root is half that number of three’s on both sides of the decimal. So ?11 is approximately 3.3, and the ?1111 is approximately 33.33, and so forth. We learned that this continues until there are eight three’s on either side of the decimal point, then they reduce in frequency.

The square root of an odd number of ones is also patterned. ?1 is 1, ?111 is 10.5, ?11111 is 105.4, ?1111111 is 1054.0, and so forth.

So we decided to write a python program to generate 20 instances. Here is the program:

#! /usr/bin/env python3
“””Determines the square roots of numbers comprised of ones like 11, 111, 1111, etc.”””
import math
bobby = 10
sue = 1
for x in range(1,20):
answer = bobby + sue
sue = answer
bobby = bobby*10
print(x+1, ‘\tThe square root of ‘, answer, ‘ is ‘, math.sqrt(answer))

And here are the answers:

2 The square root of 11 is 3.3166247903554
3 The square root of 111 is 10.535653752852738
4 The square root of 1111 is 33.331666624997915
5 The square root of 11111 is 105.40872829135166
6 The square root of 111111 is 333.333166666625
7 The square root of 1111111 is 1054.0925006848308
8 The square root of 11111111 is 3333.3333166666666
9 The square root of 111111111 is 10540.925528624135
10 The square root of 1111111111 is 33333.33333166667
11 The square root of 11111111111 is 105409.25533841894
12 The square root of 111111111111 is 333333.33333316667
13 The square root of 1111111111111 is 1054092.553389407
14 The square root of 11111111111111 is 3333333.3333333167
15 The square root of 111111111111111 is 10540925.533894593
16 The square root of 1111111111111111 is 33333333.333333332
17 The square root of 11111111111111111 is 105409255.33894598
18 The square root of 111111111111111111 is 333333333.3333333
19 The square root of 1111111111111111111 is 1054092553.3894598
20 The square root of 11111111111111111111 is 3333333333.3333335

Although it looks like the sixes also multiply, they also reduce after reaching eight in a row. Check it out with python’s decimal package. from decimal import Decimal, then in the print statement, add Decimal(math.sqrt(answer)).

Free Will

Posted on December. 7.2017 by Stephen Harris

Wednesday 6 December: a very exciting discussion about free will put on by the Erasmus Center. Sincere thanks to Jim Holden and to Erasmus for inviting me to respond to Peter Tse, author of The Neural Basis of Free Will (MIT Press, 2013).

My main point during the debate was that standards of proof and acceptable methods of testing are not yet available to neuro-scientists to establish a physiological basis of free will. Study of the neuron is the province of bio-chemistry, which has its own standards of proof and acceptable methods of testing. These standards have been developed over decades, through argument and counter-argument, and through experimentation. They are not optional—not if you seek accurate results. Freedom is a concept discussed for centuries by philosophers, theologians, political scientists, and historians. Each of those fields has its own standards of proof and acceptable methods of argumentation. Those standards are important to ensuring logical results. Will or volition is chiefly the province of psychology, with its own standards of proof and acceptable methods of testing. So bringing bio-chemical evidence to a philosophical debate about a psychological topic seems to me to be like, as Laurie Anderson said, trying to dance architecture.

A secondary point I made was that any logical investigation proceeds from the question that you set. So, setting the question correctly is essential. We would not have had a debate had Dr. Tse written a book entitled, The Neural Basis of Unconstrained Choice. The phrase “free will” connotes something in English that the phrases “unconstrained choice” or “unfettered desire” do not. [It seemed to me an example of the Sapir-Whorf fallacy that the English language should be transparent to physical reality, specifically to neural pathways.] So, I tried to show how desire is different from will in English, how French and Latin are different again, and how investigating free will in English entails different logical assumptions than investigating it in French or Latin. In English, will connotes desire, want, action. In French, arbitre connotes sight, judgment, observation. Different semantic fields with little overlap. Another example: the greatest virtue according to Christians is love. That’s English. In the Latin Bible, the word is caritas. You can also translate caritas as charity (faith, hope, and charity). You can give charity without being in love, such as for tax purposes. So which one is the virtue? Faith in the Latin Bible is fides, which can also be translated as loyalty. Which is it? There’s a big difference between obeying someone that you don’t believe in and believing someone whom you don’t obey. Same for freedom. The French prize Liberté, or liberty. Would Dr. Tse have found the same things if he looked for liberty of desire? A French-speaker who succumbed to Sapir-Whorf would look for la libre volunté in a different part of the brain than an English-speaker. And that makes no sense to me!

I also made the case that, as Gertrude Stein said of Oakland, “there’s no there there.” “Free will” is a concept that English speakers use to talk about a whole host of connected ideas and psychological processes. Free will is not a thing. It doesn’t exist the way Plymouth Rock or the Boston Marathon exist. Where do you find free will? I say, in a dictionary.

The public discussion among the guests afterwards was terrific. No one in the room doubted that the brain is essential to thinking. But there seemed a general consensus that thought is not reducible to bio-chemistry. Some people made the point that our morality and personal values depend upon a non-reductive view, on a non-physicalist view, of will. Others said that there are psychological responses that we think are free, but are actually conditioned or instinctive. So we have to distinguish the choices that are free from those that are not. Others asked whether or not free will introduces randomness into science, and if so, to what degree. (I tend to think that decisions are not made randomly, but on the basis of stochastic algorithms that measure optimality by accounting for values, external conditions, imagined results, and so forth.) What was most apparent to me is that neuro-science is not going to trump dozens of disciplines, centuries of carefully thought-out positions, and carefully considered, methodical experimentation. It reaffirmed my faith in the multiplicity of a university, of a fundamental need for diversity of viewpoints, all speaking with each other, with each one grounded in a distinct intellectual tradition.

Bede and Verlaine

Posted on October. 30.2017 by Stephen Harris

I just stumbled across Verlaine’s Bonheur, which he described as part of a Catholic triptych. He wrote it in the late 1880s and early 1890s, finishing the manuscript in January 1891. Poem 28 is in the form of an epanaleptic elegy, like Bede’s Hymn to Aethelthryth, modified into an ABABA rhyme. The first line matches the last. Here are the first two stanzas:

Les plus belles voix
De la Confrérie
Célèbrant le mois
Heureux de Marie.
O les douces voix!

Monsiuer le Curé
L’a dit à la Messe:
C’est le mois sacré.
Écoutons sans cesse
Monsieur le Curé.

I’m struck by how tenacious is the poetic form. Not only does the form allude to Latin elegies of the Church, but it also requires the poet to repeat a phrase in different contexts—which is a practice in prayer. Repetition in slightly different contexts allows the poet to restore a reader’s wandering imagination to a fixed narrative while serially enlarging the semantic force of the narrative. So we meet fairly cliché images: voices of monastery, a curate saying Mass. But slight variation and addition of phrases moves our imagination from topic to topic, connotation to connotation, and thereby gives fuller character to otherwise dry words. Verlaine focuses our attention first on superlative beauty, a characteristic we then apply to voices. We follow those voices to a confraternity. The verb célèbrant is in the plural, which places an image of celebration in our minds, but does not assign it to the singular confrérie. Instead, it is the beautiful voices that celebrate.

Next we greet the singular mois ‘month’, which until the /s/ at the end of the line leaves us in suspense, wondering if will end moine ‘monk’. The masculine mois is set in parallel with the previous line’s feminine Confrérie, indicating in part the capaciousness of the community, like a month that contains many days. A natural caesura at the end of the third line interrupts the semantic flow of the Noun + Adjective (mois heureux). This pause sets the initial words of line 3 and 4 in apposition: celebrate and joyful. And now, a bit of genius! Verlaine wraps a singular masculine phrase (le mois hereux) with two singular feminine nouns, confrérie and Marie. Here, one is reminded of Bede’s description of Caedmon’s monastery at Whitby where a community of men is ruled by a woman, Hild, and the Wisdom of God—in Latin, Wisdom is feminine.

After the celebratory kernel, we finish the wrapping with a transformed choir of voices. At first the most beautiful, now, transformed through their celebration of Mary, they are gentle and sweet. Similar transformation will take place with the mois ‘month’ which begins joyfully, then transforms in the second stanza to sacred, sacré.

This is a poem of nine stanzas. The number is not insignificant, as it is the number of months Christ was in the womb, the number of months it takes for the physical body to gain its spiritual soul, and perhaps the number of stanzas it takes for a reader suffused with the physical beauty of Verlaine’s verse to achieve a spiritual understanding.

Anglo-Saxon Lyre 4 (New Lyre)

Posted on September. 16.2017 by Stephen Harris

The Serpentine Lyre.

The second lyre is named for the interlace pattern on its headstock. The last lyre had two dragon heads, so it’s the Dragon-head lyre. Simple. Number Two is made completely of mahogany. The Sutton Hoo lyre was oak and had no sound holes. So, it seems as if the original makers relied on the sonority of the box itself to carry the sound waves. That means paying close attention to grain direction and keeping the joints tight. I decided on a frame, much like a tortion box, but with the head stock exposed.

The frame is 3/4″ x 3/4″ mahogany. There are two main points of potential warp: torque on the cross-beam and tilt on the base. The headstock is so heavy and the string pins so thick that tilt is unlikely there. So, joinry type will be chosen to account for that.

Click on any picture to enlarge it.

The main crossbrace is joined with a through-tenon. If it is seated in the mortise well, then the brace won’t tilt (or roll) forward or backward, causing the lyre’s skin to buckle.

You can see that it’s pretty well seated. There’s a little daylight, but I filled that with epoxy. All the main structural joints are epoxied. The base of the lyre is going to anchor the force of the strings, like the anchorage of the Brooklyn Bridge. So the main counterforce, it seems, has to be against tilt (or roll). So I decided on a bridle joint. I also wanted a lot of exposed long grain for glue.

I kept the proportions of the joint to thirds, which left enough wood to sustain the pressure. Even with as hard a wood as mahogany, I wouldn’t make the frame any thinner. Next, I wanted to reinforce the base with a secondary brace that would also support the bridge. To counteract both tilt and torque, I decided on a half-lap dovetail to secure the brace to the sides, and mortise and tenon to secure the stiles.

Here you can see the seating for the half-lap dovetail:

It’s a lot of chisel work! But mahogany is gorgeous to work with. Here is the entire base architecture just before glue-up:

I used regular wood glue for the crossbrace assembly, since the joints are doing most of the work.

But just to be sure, I added dowels:

In the picture above, you can also see the mahogany skin. I skinned one side with 1/16″ mahogany veneer. It was too thin to support anything structurally, so I put nto a series of very thin supports. These were walnut. I wanted something with closer grain than mahogany, both to avoid splintering and to carry high-frequency sound more efficiently.

Here is the frame with one side of skin:

After all that work and all those beautiful joints, one is tempted to show it off–very Greene & Greene! But a clean aesthetic demands otherwise, so I hid the joinery. I don’t have pictures of carving the headstock, but you can see it in the following video. (I made the video so you can hear the sound.) The saddle is a strip of walnut that spans the mahogany skin and supports a mahogany bridge. The tailpiece is mahogany and clutches the body like a c-clamp. The strings are sent through holes and their ends are tied, like a classical guitar. The tuning pegs are zither pegs, tuned with a handle.

The tuning is CDGCDG. I came up with the rhythm and melody so that the music would match the syllabification of Caedmon’s Hymn, caesurae included. Here are the first four lines (click link to watch video):

SerpentineLyre Small

Old English Parser 6

Posted on August. 7.2017 by Stephen Harris

Trinity College Library

Thanks to the generosity of the College of Humanties and Fine Arts at UMass, the parser is now working at a very basic level. Here it is: http://www.bede.net/misc/dublin/parse.html

Mark Faulkner of Trinity College Dublin aided by the generosity of the Irish Research Council brought me to Trinity in July to describe the parser. He hosted a terrific conference where a wide variety of scholars and scientists presented big-data projects. One conclusion from my perspective was that medieval texts (as well as all ancient languages) are as subject to the Law of Large Numbers as are network packets, social media posts, and traffic patterns. One surprise was how few samples were needed.

Specifically, texts (that is, generically similar sets of syntactically correct utterances) will approach audience expections as defined by genre, language, and period. These expectations are in turn characterized by sets of related vocabulary items. For example, a sermon employs certain words to indicate to an audience that it is indeed a sermon—or conversely, it is by virtue of certain vocabulary items that we classify a text as a sermon. (See Michael Drout, Lexomics.) Moreover, those vocabulary items tend ot come in serially-arranged chunks so that, as an extreme example, an adventure story may use words to describe setting and character before using words that describe dying.

Consequently, a parser that seeks to classify for genre or for aesthetic particulars considers not only vocabulary, but related vocabulary items, and the order in which those items appear over the span of the text. That parser will also consider phrases and phrase structure. Old English poetry, once recognized through its combination of rhetorical tropes (alliteration, hypotaxis, etc.), can be parsed metrically. More importantly, word combinations and clusters can be searched for and a stochastic algorithm applied in order to yield high-frequency clusters. That algorithm is the challenge. Although Google has developed similar algorithms to rank websites and to effect advertiser auctions, deciding on the likelihood of a grammatical claim is a different problem. My challenge over the next year or so is to break down the assumptions a fluent reader of Old English makes when reading a text. The biggest roadblock will be the desire to put in my own grammatical knowledge. A Natural Language Parser does not rely on a grammarian’s parse of a text. It relies instead on all texts ever written in that language.

The most exciting possibility to my mind is using glosses to tie OE literature into its Latin and Celtic analogues. A gloss is a single-word translation of one language into another. For example, French eau can be glossed by English water. Old English scribes glossed many Latin words. They wrote them in tiny letters above the Latin. Using these glosses, we could connect OE texts to Latin texts more closely and trace the migration and adaptation of images and collocations over time. Paired with information about the movement of manuscripts, we could map the dispersal of ideas, images, and metaphors over time and space.

Because some of these metaphors constitute a defining characteristics of a genre (such as lyric), we can watch the evolution of genre over time. And by examining the structure and constitution of these text in multiple languages, we could observe the interrelation and mutual influence of written culture.

The bottom line? In the next stage, I have to treat a text like an organism. Ask what other organisms are like it. Then try to dissect their DNA and determine which genetic markers came from where.

Perl Calendar > HTML

Posted on August. 7.2017 by Stephen Harris

This perl script generates the HTML for a simple 12-month calendar. Run it in Terminal: it will ask which day of the week is the first of January. Then, copy-and-paste the results to your HTML code. Modify the script as you like.

#usr/bin/perl -w

# generate a calendar for HTML
# sharris@umass[dot]edu 2017

# ___________________________________________ declarations

my $start = 0;
my @monthNames = qw(January February March April May June July August September October November December );
my @month = qw(31 28 31 30 31 30 31 31 30 31 30 31);
my @dayNames = qw(Su M Tu W Th F Sa);

my $table = “<table width=’200′ border=’1′ cellpadding=’2′ cellspacing=’1′ bordercolor=’#000000′>”;
my $tr = “<tr align=’center’ valign=’middle’>”;

my $weekday = 0; # which day of the week is it?
my $date = 1; # what date is it?
my $m = 0; # total months for giant loop offset by 1
my $offset = 0; # offset counter
my $i = 7; # 7 days in a week

# ____________________________________________ setup
print “\nOn which day of the week is the first of January? (M T W H F S U)”;
$start = <STDIN>;

# now we set the week-counter to the day of the week minus 1 as offset

if ($start == “M”){$offset = 1;}
if ($start == “T”){$offset = 2;}
if ($start == “W”){$offset = 3;}
if ($start == “H”){$offset = 4;}
if ($start == “F”){$offset = 5;}
if ($start == “S”){$offset = 6;}
if ($start == “U”){$offset = 0;}

# ___________________________________________ BIG LOOP

foreach (@monthNames) {
print “<p><b>”.$_.”</b></p>\n”;
print $table;
print $tr;

foreach (@dayNames){
print “\n<td bgcolor=’#CCCCCC’><b>”.$_.”</b></td>”;
}
print “</tr>\n”;

$date = $month[$m] – ($month[$m] + $offset) + 1;

until ($date > $month[$m]){
for ($i = 7; $i > 0; $i–){
if ($i == 7){print “<tr align=’center’ valign=’middle’>”;}
print “<td bgcolor=’#CCCCCC’>”;
if ($date > 0 && $date <= $month[$m]){print $date;} else {print “\&nbsp\;”;}
print “</td>\n”;
if ($i == 1){print “</tr>”;}

if ($date == $month[$m]){$offset = 8 – $i;}
$date++;
} # weekly loop

} # fill loop
print “</table>”;
$m++;
} # monthly loop

launchd -harris

background processes

Category Archives: Uncategorized