Getting Heads 2¶

prep in

83635

מֶ֤לֶךְ

subs king st=c

83636

מֹואָב֙

nmpr Moab st=a

83637

art the

83638

רִאשֹׁ֔ון

adjv first st=a

Result 2

2_Samuel 24:9

753091

phrase 753091 PreC NP

175628

שְׁמֹנֶה֩

subs eight st=a

175629

מֵאֹ֨ות

subs hundred st=a

175630

אֶ֤לֶף

subs thousand st=a

175631

אִֽישׁ־

subs man st=a

175632

חַ֨יִל֙

subs power st=a

Result 3

Daniel 7:25

879444

phrase 879444 Time PP

375101

prep until

375102

עִדָּ֥ן

subs time st=a

375103

conj and

375104

עִדָּנִ֖ין

subs time st=a

375105

conj and

375106

פְלַ֥ג

subs half st=c

375107

עִדָּֽן׃

subs time st=a

Result 4

Judges 9:1

726169

phrase 726169 Cmpl PP

132881

אֲלֵיהֶ֔ם

prep to

phrase 726169 Cmpl PP|CP

132882

conj and

phrase 726169 Cmpl PP

132883

prep to

132884

subs whole st=c

132885

מִשְׁפַּ֛חַת

subs clan st=c

132886

subs house st=c

132887

אֲבִ֥י

subs father st=c

132888

אִמֹּ֖ו

subs mother st=a

Result 5

1_Kings 5:3

755341

phrase 755341 PreC NP

179384

עֲשָׂרָ֨ה

subs ten st=a

179385

בָקָ֜ר

subs cattle st=a

179386

בְּרִאִ֗ים

adjv fat st=a

179387

conj and

179388

subs twenty st=a

179389

בָּקָ֛ר

subs cattle st=a

179390

רְעִ֖י

subs pasture st=a

179391

conj and

179392

מֵ֣אָה

subs hundred st=a

179393

צֹ֑אן

subs cattle st=a

Result 6

Proverbs 21:23

865306

phrase 865306 Subj NP

352627

שֹׁמֵ֣ר

subs keep qal ptca st=c

352628

פִּ֭יו

subs mouth st=a

352629

conj and

352630

לְשֹׁונֹ֑ו

subs tongue st=a

Result 7

Exodus 38:31

682106

phrase 682106 Objc PP

51034

prep <object marker>

51035

אַדְנֵ֤י

subs pedestal st=c

51036

art the

51037

חָצֵר֙

subs court st=a

phrase 682106 Objc PP|AdvP

51038

סָבִ֔יב

advb surrounding st=a

phrase 682106 Objc PP|CP

51039

conj and

phrase 682106 Objc PP

51040

prep <object marker>

51041

אַדְנֵ֖י

subs pedestal st=c

51042

subs gate st=c

51043

art the

51044

חָצֵ֑ר

subs court st=a

51045

conj and

51046

אֵ֨ת

prep <object marker>

51047

subs whole st=c

51048

יִתְדֹ֧ת

subs peg st=c

51049

art the

51050

מִּשְׁכָּ֛ן

subs dwelling-place st=a

51051

conj and

51052

prep <object marker>

51053

subs whole st=c

51054

יִתְדֹ֥ת

subs peg st=c

51055

art the

51056

חָצֵ֖ר

subs court st=a

phrase 682106 Objc PP|AdvP

51057

סָבִֽיב׃

advb surrounding st=a

Result 8

1_Chronicles 17:17

891630

phrase 891630 Cmpl PP

400585

כְּ

prep as

400586

תֹ֧ור

subs turn st=c

400587

art the

400588

אָדָ֛ם

subs human, mankind st=a

400589

art the

400590

מַּעֲלָ֖ה

subs ascent st=a

Result 9

Exodus 40:6

682452

phrase 682452 Cmpl PP

51946

prep to

51947

פְנֵ֕י

subs face st=c

51948

פֶּ֖תַח

subs opening st=c

51949

מִשְׁכַּ֥ן

subs dwelling-place st=c

51950

אֹֽהֶל־

subs tent st=c

51951

מֹועֵֽד׃

subs appointment st=a

Result 10

Leviticus 13:59

686898

phrase 686898 PreC NP

60185

תֹּורַ֨ת

subs instruction st=c

60186

נֶֽגַע־

subs stroke st=c

60187

צָרַ֜עַת

subs skin-disease st=c

60188

בֶּ֥גֶד

subs garment st=c

60189

art the

60190

צֶּ֣מֶר׀

subs wool st=a

60191

conj or

60192

art the

60193

פִּשְׁתִּ֗ים

subs flax st=a

60194

אֹ֤ו

conj or

60195

art the

60196

שְּׁתִי֙

subs texture st=a

60197

conj or

60198

art the

60199

עֵ֔רֶב

subs woof st=a

60200

אֹ֖ו

conj or

60201

subs whole st=c

60202

subs tool st=c

60203

עֹ֑ור

subs skin st=a

Result 11

702367

phrase 702367 Objc NP

subs whole st=c

subs garment st=a

conj and

subs whole st=c

subs tool st=c

subs skin st=a

conj and

subs whole st=c

subs deed st=c

subs goat st=a

conj and

subs whole st=c

subs tool st=c

subs tree st=a

Result 12

2_Chronicles 30:12

902589

phrase 902589 Objc NP

422131

מִצְוַ֥ת

subs commandment st=c

422132

art the

422133

מֶּ֛לֶךְ

subs king st=a

422134

conj and

422135

art the

422136

שָּׂרִ֖ים

subs chief st=a

Result 13

2_Kings 12:13

769247

phrase 769247 Objc NP

202861

עֵצִים֙

subs tree st=a

202862

conj and

202863

אַבְנֵ֣י

subs stone st=c

202864

מַחְצֵ֔ב

subs hewn stone st=a

Result 14

2_Kings 23:26

773413

phrase 773413 Cmpl PP

210599

prep from

210600

חֲרֹ֤ון

subs anger st=c

210601

אַפֹּו֙

subs nose st=a

210602

art the

210603

גָּדֹ֔ול

adjv great st=a

Result 15

Numbers 4:26

693340

phrase 693340 Objc PP

72277

אֵת֩

prep <object marker>

72278

קַלְעֵ֨י

subs curtain st=c

72279

art the

72280

חָצֵ֜ר

subs court st=a

72281

conj and

72282

prep <object marker>

72283

מָסַ֣ךְ׀

subs covering st=c

72284

פֶּ֣תַח׀

subs opening st=c

72285

subs gate st=c

72286

art the

72287

חָצֵ֗ר

subs court st=a

phrase 693340 Objc PP|CP

72297

conj and

phrase 693340 Objc PP

72298

אֵת֙

prep <object marker>

72299

מֵֽיתְרֵיהֶ֔ם

subs string st=a

72300

conj and

72301

prep <object marker>

72302

subs whole st=c

72303

כְּלֵ֖י

subs tool st=c

72304

עֲבֹדָתָ֑ם

subs work st=a

72305

conj and

72306

אֵ֨ת

prep <object marker>

72307

subs whole st=c

Result 16

702367

phrase 702367 Objc NP

subs whole st=c

subs garment st=a

conj and

subs whole st=c

subs tool st=c

subs skin st=a

conj and

subs whole st=c

subs deed st=c

subs goat st=a

conj and

subs whole st=c

subs tool st=c

subs tree st=a

Result 17

1_Kings 5:10

755396

phrase 755396 Adju PP

179543

מֵֽ

prep from

179544

חָכְמַ֖ת

subs wisdom st=c

179545

subs whole st=c

179546

בְּנֵי־

subs son st=c

179547

קֶ֑דֶם

subs front st=a

179548

conj and

179549

prep from

179550

כֹּ֖ל

subs whole st=c

179551

חָכְמַ֥ת

subs wisdom st=c

179552

מִצְרָֽיִם׃

nmpr Egypt st=a

Result 18

Jeremiah 11:8

792948

phrase 792948 Cmpl PP

240501

בִּ

prep in

240502

שְׁרִיר֖וּת

subs stubbornness st=c

240503

לִבָּ֣ם

subs heart st=a

240504

art the

240505

רָ֑ע

adjv evil st=a

Result 19

Isaiah 21:17

778315

phrase 778315 Subj NP

218969

שְׁאָ֧ר

subs rest st=c

218970

מִסְפַּר־

subs number st=c

218971

קֶ֛שֶׁת

subs bow st=c

218972

גִּבֹּורֵ֥י

subs vigorous st=c

218973

בְנֵֽי־

subs son st=c

218974

קֵדָ֖ר

nmpr Kedar st=a

Result 20

Exodus 39:40

682381

phrase 682381 Objc PP

51772

אֵת֩

prep <object marker>

51773

קַלְעֵ֨י

subs curtain st=c

51774

art the

51775

חָצֵ֜ר

subs court st=a

51776

prep <object marker>

51777

עַמֻּדֶ֣יהָ

subs pillar st=a

51778

conj and

51779

prep <object marker>

51780

אֲדָנֶ֗יהָ

subs pedestal st=a

51781

conj and

51782

prep <object marker>

51783

art the

51784

מָּסָךְ֙

subs covering st=a

phrase 682381 Objc PP

51785

prep to

51786

subs gate st=c

51787

art the

51788

חָצֵ֔ר

subs court st=a

phrase 682381 Objc PP

51789

prep <object marker>

51790

מֵיתָרָ֖יו

subs string st=a

51791

וִ

conj and

51792

יתֵדֹתֶ֑יהָ

subs peg st=a

51793

conj and

51794

אֵ֗ת

prep <object marker>

51795

subs whole st=c

51796

כְּלֵ֛י

subs tool st=c

51797

עֲבֹדַ֥ת

subs work st=c

51798

art the

51799

מִּשְׁכָּ֖ן

subs dwelling-place st=a

phrase 682381 Objc PP

51800

prep to

51801

אֹ֥הֶל

subs tent st=c

51802

מֹועֵֽד׃

subs appointment st=a

Result 21

1_Chronicles 9:29

889526

phrase 889526 Cmpl PP

396458

prep upon

396459

art the

396460

כֵּלִ֔ים

subs tool st=a

396461

conj and

396462

עַ֖ל

prep upon

396463

subs whole st=c

396464

כְּלֵ֣י

subs tool st=c

396465

art the

396466

קֹּ֑דֶשׁ

subs holiness st=a

396467

conj and

396468

prep upon

396469

art the

396470

סֹּ֨לֶת֙

subs wheat groat st=a

phrase 889526 Cmpl PP|CP

396471

conj and

phrase 889526 Cmpl PP|NP

396472

art the

396473

יַּ֣יִן

subs wine st=a

396474

conj and

396475

art the

396476

שֶּׁ֔מֶן

subs oil st=a

396477

conj and

396478

art the

396479

לְּבֹונָ֖ה

subs incense st=a

396480

conj and

396481

art the

396482

בְּשָׂמִֽים׃

subs balsam-tree st=a

Result 22

2_Kings 6:17

766610

phrase 766610 Objc NP

198502

סוּסִ֥ים

subs horse st=a

198503

conj and

198504

רֶ֛כֶב

subs chariot st=c

198505

אֵ֖שׁ

subs fire st=a

Result 23

1_Chronicles 28:13

894166

phrase 894166 PreC PP

405834

prep to

405835

מַחְלְקֹות֙

subs division st=c

405836

art the

405837

כֹּהֲנִ֣ים

subs priest st=a

405838

conj and

405839

art the

405840

לְוִיִּ֔ם

subs Levite st=a

405841

וּֽ

conj and

405842

prep to

405843

subs whole st=c

405844

מְלֶ֖אכֶת

subs work st=c

405845

עֲבֹודַ֣ת

subs work st=c

405846

subs house st=c

405847

יְהוָ֑ה

nmpr YHWH st=a

405848

וּֽ

conj and

405849

prep to

405850

subs whole st=c

405851

כְּלֵ֖י

subs tool st=c

405852

עֲבֹודַ֥ת

subs work st=c

405853

subs house st=c

405854

יְהוָֽה׃

nmpr YHWH st=a

Result 24

Numbers 3:8

692856

phrase 692856 Objc PP

71029

prep <object marker>

71030

subs whole st=c

71031

כְּלֵי֙

subs tool st=c

71032

אֹ֣הֶל

subs tent st=c

71033

מֹועֵ֔ד

subs appointment st=a

71034

conj and

71035

prep <object marker>

71036

מִשְׁמֶ֖רֶת

subs guard-post st=c

71037

בְּנֵ֣י

subs son st=c

71038

nmpr Israel st=a

Result 25

2_Samuel 17:14

750120

phrase 750120 Objc PP

170631

prep <object marker>

170632

עֲצַ֤ת

subs counsel st=c

170633

אֲחִיתֹ֨פֶל֙

nmpr Ahithophel st=a

170634

art the

170635

טֹּובָ֔ה

adjv good st=a

				...RESULTS CUT OFF AT 25...

For all of these cases, we add the second substantive (pink) into the dwords set:

In [15]:

dwordsadded = 0
for res in missing_atr_rec:
    dword = res[3] if len(res) == 4 else res[4]
    dwords.add(dword)
    dwordsadded += 1
print(f'{dwordsadded} words added to dwords...')

78 words added to dwords...

Jer 9:23, missed משפת¶

There is one case where משפת occurs together with חסד in Jer 9:23. Because it occurs in no other subphrase, and is contained only with חסד, this word has incorrectly been marked above as a dword. This happens because this word's subphrase pattern is unique in the BHSA dataset. We thus make an exclusion here and remove it. However, to do so requires careful selection of the word node to avoid problems when the data changes. Thus, we apply a somewhat lengthy search template that isolates this and only this word to remove from dword.

In [16]:

fix_mishpat = A.search('''

book book@en=Jeremiah
    chapter chapter=9
        verse verse=23
            word lex=MCPV/
''')

missed_mishpat = fix_mishpat[0][-1]

A.prettyTuple((missed_mishpat,), seqNumber=0)

  0.44s 1 result

Result 0

Jeremiah 9:23

792539

phrase 792539 Objc NP

239841

חֶ֛סֶד

subs loyalty st=a

239842

מִשְׁפָּ֥ט

subs justice st=a

239843

conj and

239844

צְדָקָ֖ה

subs justice st=a

In [17]:

missed_mishpat in dwords

Out[17]:

True

In [18]:

# remove it from dword

dwords.remove(missed_mishpat)
missed_mishpat in dwords

Out[18]:

False

Missing בתר in Gen 15:10¶

Another accidental addition dwords is this case in Gen 15:10 where the substantive בתר is modified by אישׁ. This case is unique because the two are separated with a maqef, but the first item modifies the second (rather than the other way around). We remove בתר from dwords.

In [19]:

fix_btr = A.search('''

book book@en=Genesis
    chapter chapter=15
        verse verse=10
            word lex=BTR/

''')

missed_btr = fix_btr[0][-1]

A.prettyTuple((missed_btr,), seqNumber=0)

  0.45s 1 result

Result 0

Genesis 15:10

655306

phrase 655306 Objc NP

6839

אִישׁ־

subs man st=a

6840

בִּתְרֹ֖ו

subs piece st=a

In [20]:

dwords.remove(missed_btr)
missed_btr in dwords

Out[20]:

False

Coordinations with Modifying Term¶

There remain multiple cases where the modifying words selected above have a coordinate word. We isolate those cases below and add them to dwords.

In [21]:

par_dwords = A.search('''

phrase
    s1:subphrase
    /without/
        quant
    /-/
        =: dword
    s2:subphrase rela=par
        word pdp=subs|nmpr|adjv
        /without/
        subphrase rela=NA
            ..
        /-/
s1 <mother- s2

''', sets={'dword':dwords, 'quant': quantifiers}) + A.search('''

phrase
    s1:subphrase
    /without/
        quant
    /-/
        := dword
    s2:subphrase rela=par
        word pdp=subs|nmpr|adjv
        /without/
        subphrase rela=NA
            ..
        /-/
s1 <mother- s2

''', sets={'dword':dwords, 'quant': quantifiers})


new_par_dwords = set()

for res in par_dwords:    
    newdword = res[4]
    new_par_dwords.add(newdword)
    dwords.add(newdword)
    
print(f'{len(new_par_dwords)} new dwords added to dword set...')

  1.10s 5 results
  0.90s 12 results
12 new dwords added to dword set...

Missing Quantifier Relations¶

During the testing and development of these templates, a number of subphrase mistakes were found in the data. These primarily consist of missing relations between quantified elements and the cardinal number, as seen below. The code displays the phrases in question, their selected heads, and the subphrase relations found within them. The lack of a subphrase containing the number, or the lack of a relation from the number to the quantified has resulted in poorly-selected heads. Typically these relations are expressed with subphrase relations of rec or adj.

In [22]:

phrases_to_patch = []

# book, chapter, verse, clause_atom number, phrase number
missing_quant_relas = [('Daniel', 3, 23, 444, 2),
                       ('Daniel', 9, 25, 1423, 2),
                       ('Ezra', 1, 9, 39, 1),
                       ('Ezra', 1, 10, 42, 1),
                       ('Ezra', 8, 20, 621, 3),
                       ('1_Chronicles', 12, 29, 1071, 3)]

for book, chapter, verse, clat_nu, phrase_nu  in missing_quant_relas:
    findit = f'''    
    book book@en={book}
        chapter chapter={chapter}
            verse verse={verse}
                clause_atom number={clat_nu}
                    phrase number={phrase_nu}
    '''
    phrase = A.search(textwrap.dedent(findit))[0][4]
    
    A.prettyTuple((phrase,), seqNumber=0)
    print('subphrase relations:')
    show_subphrases(phrase)
        
    phrases_to_patch.append(phrase)

  0.49s 1 result

Result 0

Daniel 3:23

877452

phrase 877452 Subj NP

372116

גֻבְרַיָּ֤א

subs man st=e

372117

אִלֵּךְ֙

prde these

372118

תְּלָ֣תֵּהֹ֔ון

subs three st=a

phrase 877452 Subj NP|PrNP

372119

שַׁדְרַ֥ךְ

nmpr Shadrach st=a

372120

מֵישַׁ֖ךְ

nmpr Meshach st=a

372121

conj and

372122

עֲבֵ֣ד נְגֹ֑ו

nmpr Abed-Nego st=a

subphrase relations:
-------1395312----------------

גֻבְרַיָּ֤א  -NA-> 
nodes:  1395312 -NA-> 
slots:  [372116] -NA-> ()
------------------------------
-------1395313----------------

אִלֵּךְ֙  -dem-> גֻבְרַיָּ֤א 
nodes:  1395313 -dem-> 1395312
slots:  [372117] -dem-> [372116]
------------------------------
-------1395314----------------

שַׁדְרַ֥ךְ  -NA-> 
nodes:  1395314 -NA-> 
slots:  [372119] -NA-> ()
------------------------------
-------1395315----------------

מֵישַׁ֖ךְ  -par-> שַׁדְרַ֥ךְ 
nodes:  1395315 -par-> 1395314
slots:  [372120] -par-> [372119]
------------------------------
-------1395316----------------

מֵישַׁ֖ךְ  -NA-> 
nodes:  1395316 -NA-> 
slots:  [372120] -NA-> ()
------------------------------
-------1395317----------------

עֲבֵ֣ד נְגֹ֑ו  -par-> מֵישַׁ֖ךְ 
nodes:  1395317 -par-> 1395316
slots:  [372122] -par-> [372120]
------------------------------
  0.44s 1 result

Result 0

Daniel 9:25

880173

phrase 880173 Time NP

376359

שָׁבֻעִ֞ים

subs week st=a

376360

שִׁשִּׁ֣ים

subs six st=a

376361

conj and

376362

שְׁנַ֗יִם

subs two st=a

subphrase relations:
-------1396382----------------

שָׁבֻעִ֞ים שִׁשִּׁ֣ים  -NA-> 
nodes:  1396382 -NA-> 
slots:  [376359, 376360] -NA-> ()
------------------------------
-------1396380----------------

שָׁבֻעִ֞ים  -NA-> 
nodes:  1396380 -NA-> 
slots:  [376359] -NA-> ()
------------------------------
-------1396381----------------

שִׁשִּׁ֣ים  -par-> שָׁבֻעִ֞ים 
nodes:  1396381 -par-> 1396380
slots:  [376360] -par-> [376359]
------------------------------
-------1396383----------------

שְׁנַ֗יִם  -par-> שָׁבֻעִ֞ים שִׁשִּׁ֣ים 
nodes:  1396383 -par-> 1396382
slots:  [376362] -par-> [376359, 376360]
------------------------------
  0.47s 1 result

Result 0

Ezra 1:9

881401

phrase 881401 PreC NP

378379

מַחֲלָפִ֖ים

subs disease st=a

378380

תִּשְׁעָ֥ה

subs nine st=a

378381

conj and

378382

עֶשְׂרִֽים׃ ס

subs twenty st=a

subphrase relations:
-------1396930----------------

מַחֲלָפִ֖ים תִּשְׁעָ֥ה  -NA-> 
nodes:  1396930 -NA-> 
slots:  [378379, 378380] -NA-> ()
------------------------------
-------1396928----------------

מַחֲלָפִ֖ים  -NA-> 
nodes:  1396928 -NA-> 
slots:  [378379] -NA-> ()
------------------------------
-------1396929----------------

תִּשְׁעָ֥ה  -par-> מַחֲלָפִ֖ים 
nodes:  1396929 -par-> 1396928
slots:  [378380] -par-> [378379]
------------------------------
-------1396931----------------

עֶשְׂרִֽים׃ ס  -par-> מַחֲלָפִ֖ים תִּשְׁעָ֥ה 
nodes:  1396931 -par-> 1396930
slots:  [378382] -par-> [378379, 378380]
------------------------------
  0.45s 1 result

Result 0

Ezra 1:10

881404

phrase 881404 PreC NP

378393

כֵּלִ֥ים

subs tool st=a

378394

אֲחֵרִ֖ים

adjv other st=a

378395

אָֽלֶף׃ ס

subs thousand st=a

subphrase relations:
-------1396946----------------

כֵּלִ֥ים  -NA-> 
nodes:  1396946 -NA-> 
slots:  [378393] -NA-> ()
------------------------------
-------1396947----------------

אֲחֵרִ֖ים  -atr-> כֵּלִ֥ים 
nodes:  1396947 -atr-> 1396946
slots:  [378394] -atr-> [378393]
------------------------------
  0.43s 1 result

Result 0

Ezra 8:20

882985

phrase 882985 Objc NP

381870

נְתִינִ֖ים

subs temple slave st=a

381871

מָאתַ֣יִם

subs hundred st=a

381872

conj and

381873

עֶשְׂרִ֑ים

subs twenty st=a

subphrase relations:
-------1398630----------------

נְתִינִ֖ים  -NA-> 
nodes:  1398630 -NA-> 
slots:  [381870] -NA-> ()
------------------------------
-------1398631----------------

מָאתַ֣יִם  -par-> נְתִינִ֖ים 
nodes:  1398631 -par-> 1398630
slots:  [381871] -par-> [381870]
------------------------------
-------1398632----------------

מָאתַ֣יִם  -NA-> 
nodes:  1398632 -NA-> 
slots:  [381871] -NA-> ()
------------------------------
-------1398633----------------

עֶשְׂרִ֑ים  -par-> מָאתַ֣יִם 
nodes:  1398633 -par-> 1398632
slots:  [381873] -par-> [381871]
------------------------------
  0.42s 1 result

Result 0

1_Chronicles 12:29

890391

phrase 890391 PreC NP

398214

שָׂרִ֖ים

subs chief st=a

398215

subs twenty st=a

398216

conj and

398217

שְׁנָֽיִם׃ ס

subs two st=a

subphrase relations:
-------1405362----------------

שָׂרִ֖ים עֶשְׂרִ֥ים  -NA-> 
nodes:  1405362 -NA-> 
slots:  [398214, 398215] -NA-> ()
------------------------------
-------1405360----------------

שָׂרִ֖ים  -NA-> 
nodes:  1405360 -NA-> 
slots:  [398214] -NA-> ()
------------------------------
-------1405361----------------

עֶשְׂרִ֥ים  -par-> שָׂרִ֖ים 
nodes:  1405361 -par-> 1405360
slots:  [398215] -par-> [398214]
------------------------------
-------1405363----------------

שְׁנָֽיִם׃ ס  -par-> שָׂרִ֖ים עֶשְׂרִ֥ים 
nodes:  1405363 -par-> 1405362
slots:  [398217] -par-> [398214, 398215]
------------------------------

Perhaps it is significant that all of these examples come from the same cluster of books: Daniel, Ezra, and 1 Chronicles. This may indicate that the individual who encoded these texts did not understand the standard for relations of quantification in the ETCBC database. These are all cases that a second BHSA should address. For now, it is fair to correct them manually by removing all quantifiers selected as heads from these phrases.

The template below is tuned to pick out these examples: primarily they are cases where a phrase atom contains a quantified substantive, and this substantive has no dependent subphrase relations. Two additional checks are made with /or/ to cover the peculiar cases of Ezra 1:20 (כלים אחרים אלף, i.e. adjv intervenes between quantifier) and Daniel 3:23 (גבריא אלך תלתהן, i.e. where a demonstrative intervenes).

In [23]:

non_rela_cardinals = A.search('''
phrase
    phrase_atom
        
        w1:word ls=card
        
        /without/
        subphrase rela=atr|adj|rec
            ..
        /-/
        /with/
        phrase_atom
            nonquantprep pdp=subs|nmpr
            <: word ls=card prs=absent
            < w1 prs=absent
        /or/
        phrase_atom
            nonquantprep pdp=subs|nmpr
            <: word pdp=prde
            <: w1
        /or/
        phrase_atom
            nonquantprep pdp=subs|nmpr
            <: word pdp=adjv
            < w1 prs=absent
        /-/
        
''', sets=sets)

  1.72s 31 results

In [24]:

A.show([res for res in non_rela_cardinals if res[0] in phrases_to_patch])

phrase 1

Daniel 3:23

877452

phrase 877452 Subj NP

372116

גֻבְרַיָּ֤א

subs man st=e

372117

אִלֵּךְ֙

prde these

372118

תְּלָ֣תֵּהֹ֔ון

subs three st=a

phrase 877452 Subj NP|PrNP

372119

שַׁדְרַ֥ךְ

nmpr Shadrach st=a

372120

מֵישַׁ֖ךְ

nmpr Meshach st=a

372121

conj and

372122

עֲבֵ֣ד נְגֹ֑ו

nmpr Abed-Nego st=a

phrase 2

Daniel 9:25

880173

phrase 880173 Time NP

376359

שָׁבֻעִ֞ים

subs week st=a

376360

שִׁשִּׁ֣ים

subs six st=a

376361

conj and

376362

שְׁנַ֗יִם

subs two st=a

phrase 3

Ezra 1:9

881401

phrase 881401 PreC NP

378379

מַחֲלָפִ֖ים

subs disease st=a

378380

תִּשְׁעָ֥ה

subs nine st=a

378381

conj and

378382

עֶשְׂרִֽים׃ ס

subs twenty st=a

phrase 4

Ezra 1:10

881404

phrase 881404 PreC NP

378393

כֵּלִ֥ים

subs tool st=a

378394

אֲחֵרִ֖ים

adjv other st=a

378395

אָֽלֶף׃ ס

subs thousand st=a

phrase 5

Ezra 8:20

882985

phrase 882985 Objc NP

381870

נְתִינִ֖ים

subs temple slave st=a

381871

מָאתַ֣יִם

subs hundred st=a

381872

conj and

381873

עֶשְׂרִ֑ים

subs twenty st=a

phrase 6

1_Chronicles 12:29

890391

phrase 890391 PreC NP

398214

שָׂרִ֖ים

subs chief st=a

398215

subs twenty st=a

398216

conj and

398217

שְׁנָֽיִם׃ ס

subs two st=a

Below we check to see how many of our cases are covered by these criteria.

In [25]:

accounted = set(phrases_to_patch) & set(res[0] for res in non_rela_cardinals)
len(accounted)

Out[25]:

This covers all the missed quantifiers above as well as a few extra that I have manually inspected to ensure none are good heads.

In [26]:

dwordsadded = 0
for res in non_rela_cardinals:
    dword = res[2]
    dwords.add(dword)
    dwordsadded += 1
print(f'{dwordsadded} words added to dwords...')

31 words added to dwords...

Incorrect Relation Assignment¶

There are a handful of cases where the ETCBC data has a relation that points at the wrong object. The few cases below are those which could not be fixed programmatically due to the complexity of the problem.

Incorrect `par` Relations¶

In [27]:

bad_pars = []

bad_par1 = '''

book book@en=Jeremiah
    chapter chapter=32
        verse verse=32
            phrase
                word lex=BN/
                <: word lex=JHWDH/
'''
badpar1_note = 'בני־יהודה should be parallel to בני־ישראל rather than רעת בני־ישראל'
bad_pars.append({'template':bad_par1, 'phrasei':3, 'badi':4, 'note':badpar1_note})

bad_par2 = '''

book book@en=Jeremiah
    chapter chapter=40
        verse verse=1
            phrase
                word lex=JHWDH/
'''
badpar2_note = 'יהודה should be parallel to ירושלים rather than גלות־ירושלים'
bad_pars.append({'template':bad_par2, 'phrasei':3, 'badi':4, 'note':badpar2_note})

In [28]:

bad_par_dwords = set()

for i, bp in enumerate(bad_pars):
    bp_res = A.search(bp['template'], silent=False)
    phrase = bp_res[0][bp['phrasei']]
    bad = bp_res[0][bp['badi']]
    bad_par_dwords.add(bad)
    
    print()
    A.prettyTuple((phrase, bad), seqNumber=bp['note'])
    print(f'subphrases containing slot {bad}')
    show_subphrases(bad, direction=L.u)

  1.06s 1 result

Result בני־יהודה should be parallel to בני־ישראל rather than רעת בני־ישראל

Jeremiah 32:32

799762

phrase 799762 Adju PP

252219

עַל֩

prep upon

252220

subs whole st=c

252221

רָעַ֨ת

subs evil st=c

252222

subs son st=c

252223

יִשְׂרָאֵ֜ל

nmpr Israel st=a

252224

conj and

252225

בְנֵ֣י

subs son st=c

252226

יְהוּדָ֗ה

nmpr Judah st=a

subphrases containing slot 252225
-------1366882----------------

בְנֵ֣י  -NA-> 
nodes:  1366882 -NA-> 
slots:  [252225] -NA-> ()
------------------------------
-------1366884----------------

בְנֵ֣י יְהוּדָ֗ה  -par-> רָעַ֨ת בְּנֵֽי־יִשְׂרָאֵ֜ל 
nodes:  1366884 -par-> 1366881
slots:  [252225, 252226] -par-> [252221, 252222, 252223]
------------------------------
-------1366885----------------

רָעַ֨ת בְּנֵֽי־יִשְׂרָאֵ֜ל וּבְנֵ֣י יְהוּדָ֗ה  -rec-> כָּל־
nodes:  1366885 -rec-> 252220
slots:  [252221, 252222, 252223, 252224, 252225, 252226] -rec-> ()
------------------------------
  0.66s 1 result

Result יהודה should be parallel to ירושלים rather than גלות־ירושלים

Jeremiah 40:1

802195

phrase 802195 Adju PP

256807

prep in

256808

תֹ֨וךְ

subs midst st=c

256809

subs whole st=c

256810

גָּל֤וּת

subs exile st=c

256811

יְרוּשָׁלִַ֨ם֙

nmpr Jerusalem st=a

256812

conj and

256813

יהוּדָ֔ה

nmpr Judah st=a

subphrases containing slot 256813
-------1368189----------------

יהוּדָ֔ה  -par-> גָּל֤וּת יְרוּשָׁלִַ֨ם֙ 
nodes:  1368189 -par-> 1368188
slots:  [256813] -par-> [256810, 256811]
------------------------------
-------1368190----------------

גָּל֤וּת יְרוּשָׁלִַ֨ם֙ וִֽיהוּדָ֔ה  -rec-> כָּל־
nodes:  1368190 -rec-> 256809
slots:  [256810, 256811, 256812, 256813] -rec-> ()
------------------------------
-------1368191----------------

כָּל־גָּל֤וּת יְרוּשָׁלִַ֨ם֙ וִֽיהוּדָ֔ה  -rec-> תֹ֨וךְ 
nodes:  1368191 -rec-> 256808
slots:  [256809, 256810, 256811, 256812, 256813] -rec-> ()
------------------------------

As can be seen in the subphrase printouts, these parallel relations do not point to the best subphrase. These problems are fixed below by adding the paralleled terms to dwords.

In [29]:

dwords |= bad_par_dwords
list(bad_par_dwords)[0] in dwords # sanity check

Out[29]:

True

Completing `iword`¶

Below dword is finalized and used to create the iword set.

In [30]:

iwords = set(w for w in F.otype.s('word') if w not in dwords)
sets['iword'] = iwords
sets['dword'] = dwords

Selecting Heads¶

Now that the pre-processing procedures are complete, they can be applied to the search templates to find the phrase heads. I will follow a process of deduction for assigning heads to phrases. So, we first select all phrases, and then track which heads are accounted for.

In [31]:

remaining_phrases = set(result[0] for result in A.search('phrase')) # get all phrases
covered_phrases = set() # put covered phrases here
remaining_types = list(feat[0] for feat in F.typ.freqList(nodeTypes='phrase')) # track and elminate phrase types

  0.26s 253207 results

All phrase to head assignments will be made in the dictionary below:

In [32]:

phrase2heads = collections.defaultdict(set)

In order to assist the process of elimination, the functions below programmatically record the heads in phrase2heads and remove them from the remaining set. query_heads iterates through a dictionary of queries and calls record_head on each result. heads_status provides a simple readout of what phrases remain to be analysed.

In [33]:

def record_head(phrase, head, mapping=phrase2heads, remaining=remaining_phrases, covered=covered_phrases):
    '''
    Simple function to track phrases
    with heads that are accounted for
    and to modify the phrase2heads
    dict, which is a mapping from a phrase
    node to its head nodes.
    '''
    # try/except accounts for phrases with plural heads, 
    # one of which is already recorded
    try:
          remaining.remove(phrase)
    except: 
        pass
    
    if F.otype.v(phrase) == 'word':
        raise Exception(f'node {phrase} is a word not a phrase!')
    
    mapping[phrase].add(head) # record it
    covered.add(phrase)
    
def query_heads(querydict, phrasei=0, headi=1, sets={}):
    '''
    Runs queries on phrasetype/query dict.
    Reports results.
    Adds results.
    
    phrasei - the index of the phrase result in the search template.
    headi - the index of the head result in the search template.
    sets - custom sets for TF search
    '''
    for phrasetype, query in querydict.items():
        print(f'running query on {phrasetype}')
        results = A.search(query, silent=True, sets=sets)
        print(f'\t{len(results)} results found')
        for res in results:
            phrase, head = res[phrasei], res[headi]
            record_head(phrase, head)
            
def heads_status():
    # simply prints accounted vs unaccounted heads
    print(f'{len(covered_phrases)} phrases matched with a head...')
    print(f'{len(remaining_phrases)} phrases remaining...')

Simple Heads¶

The selection of heads for certain phrase types is very straightforward. Those are defined in the templates below and are subsequently applied. These phrase types are selected based on the survey of their subphrase relations as found in the old notebook.

In [34]:

simp_heads = dict(

PPrP = '''
% personal pronoun

phrase typ=PPrP
    iphrase_atom
        iword pdp=prps

''',

DPrP = '''
% demonstrative pronoun

phrase typ=DPrP
    iphrase_atom
        iword pdp=prde

''',

InjP = '''
% interjectional

phrase typ=InjP
    iphrase_atom
        iword pdp=intj

''',

NegP = '''
% negative

phrase typ=NegP
    iphrase_atom
        iword pdp=nega

''',

InrP = '''
% interrogative

phrase typ=InrP
    iphrase_atom
        iword pdp=inrg

''',
    
IPrP = '''
% interrogative pronoun

phrase typ=IPrP
    iphrase_atom
        iword pdp=prin

''',

) # end of dictionary

Make Queries, Record Heads, See What Remains¶

Here we run the queries and run record_head over each result. In all of the templates the head is the second item in the result tuple.

In [35]:

query_heads(simp_heads, headi=2, sets=sets)
        
print('\n', '<>'*20, '\n')
heads_status()

running query on PPrP
	4386 results found
running query on DPrP
	790 results found
running query on InjP
	1883 results found
running query on NegP
	6742 results found
running query on InrP
	1291 results found
running query on IPrP
	798 results found

 <><><><><><><><><><><><><><><><><><><><> 

15866 phrases matched with a head...
237341 phrases remaining...

Find Remaining Phrases¶

What phrases with the above types remain unaccounted for?

In [36]:

unaccounted_simp = set(phrase for phrase in remaining_phrases
                          if F.typ.v(phrase) in simp_heads)
len(unaccounted_simp)

Out[36]:

In [37]:

for typ in simp_heads:
    remaining_types.remove(typ)
print(remaining_types)

['VP', 'PP', 'CP', 'NP', 'PrNP', 'AdvP', 'AdjP']

Mostly Simple Heads¶

The next set of heads require a bit more care since they can contain a bigger variety of relationships.

VP¶

There is only one complication for the VP: that is that there is one VP that has more than one verb:

In [38]:

mult_verbs = A.search('''

phrase typ=VP
/with/
    word pdp=verb
    < word pdp=verb
/-/
''')
A.show(mult_verbs, condenseType='clause')

  1.29s 1 result

phrase 1

1_Chronicles 24:6

511867

clause 511867 Ptcp

phrase 893309 Conj CP

403601

conj and

phrase 893310 PreC VP

403602

אָחֻ֥ז׀

verb seize qal ptcp st=a

403603

אָחֻ֖ז

verb seize qal ptcp st=a

phrase 893311 Adju PP

403604

prep to

403605

אִיתָמָֽר׃ פ

nmpr Ithamar st=a

The template below excludes this case without ignoring VPs that do not necessarily begin with a verb.

In [39]:

VP = '''

phrase typ=VP
    
    head:iword pdp=verb
    
    /without/
    phrase
        word pdp=verb
        < head
    /-/

'''

VP_search = A.search(VP, sets=sets)

for phrase, head in VP_search:
    record_head(phrase, head)
    
heads_status()

  1.62s 69024 results
84890 phrases matched with a head...
168317 phrases remaining...

`VP` Sanity Check¶

We double check that the indicated phrase above only has one head.

In [40]:

phrase2heads[893310]

Out[40]:

{403602}

See what's left...

In [41]:

unaccounted_vp = set(phrase for phrase in remaining_phrases
                          if F.typ.v(phrase) == 'VP')
len(unaccounted_vp)

Out[41]:

In [42]:

remaining_types.remove('VP')
print(remaining_types)

['PP', 'CP', 'NP', 'PrNP', 'AdvP', 'AdjP']

CP¶

The conjunction phrase is relatively straightforward. But there are 1140 cases where the conjunction is technically headed by a preposition in the ETCBC data. These are phrases such as בטרם and בעבור (see the more detailed analysis in the prev. notebook). It is not clear at all why the ETCBC encodes these as conjunction phrases. This is almost certainly a confusion of the formal typ value and the functional function label (with a value of Conj). Nevertheless, here we make a choice to select the preposition as the true head.

In a second BHSA, these cases ought to be repaired.

In [43]:

cp_heads = dict(

conj = '''

phrase typ=CP
/without/
    word pdp=prep
/-/
    iphrase_atom
        head:iword pdp=conj
        /without/
        phrase_atom
            word pdp=conj
            <: head
        /-/

''',
    
prep_conj = '''

phrase typ=CP
    iphrase_atom
        =: word pdp=prep

'''

)

In [44]:

query_heads(cp_heads, headi=2, sets=sets)
        
print('\n', '<>'*20, '\n')
heads_status()

running query on conj
	51341 results found
running query on prep_conj
	1140 results found

 <><><><><><><><><><><><><><><><><><><><> 

137371 phrases matched with a head...
115836 phrases remaining...

`CP` Sanity Check¶

In [45]:

unaccounted_cp = set(phrase for phrase in remaining_phrases
                          if F.typ.v(phrase) == 'CP')
len(unaccounted_cp)

Out[45]:

In [46]:

remaining_types.remove('CP')
print(remaining_types)

['PP', 'NP', 'PrNP', 'AdvP', 'AdjP']

AdjP¶

The adjective phrase always occurs with a word that has a pdp of adjective:

In [47]:

A.search('''

phrase typ=AdjP
/without/
    word pdp=adjv
/-/

''')

  0.65s 0 results

Out[47]:

[]

By playing with the head:word pdp= value below, I ascertain that there are 8 uses of subs as a head in this phrase type, and 1 use of advb as a head. These variants are due to the phrase containing multiple heads, with the first having a pdp of adjv, formally making the phrase an AdjP.

The selection criteria is as follows. We want all cases in an adjective phrase where the word has a pdp of adjv, subs, or advb. The head candidate must not be found in a modifying subphrase, defined as rela=adj|atr|rec|mod|dem (remember that a word can often occur in multiple subphrases); and the only acceptable values for phrase_atom and subphrase relations are either NA (no relation), or Para/par (coordinate relation). In this latter case, it is expected here that the first requirement will prevent spurious parallel results (that is, words that are parallel not to a head but to a modifying element).

The requirements are set in the pattern below.

In [48]:

AdjP = '''

phrase typ=AdjP
    iphrase_atom
        head:iword pdp=adjv|subs|advb
        
% require either NA subphrase relation
% or no subphrase embedding:
        /with/
        subphrase rela=NA|par
            head
        /or/
        /without/
        subphrase
            head
        /-/
        /-/
        
% exclude uses as modifier:
        /without/
        subphrase rela=adj|atr|rec|mod|dem
            head
        /-/
        
% ensure word is not immediately preceded by a construct form
        /without/
        phrase_atom
            word st=c
            <: head
        /-/
'''
AdjP = A.search(AdjP, sets=sets)

for res in AdjP:
    phrase, head = res[0], res[2]
    record_head(phrase, head)
    
heads_status()

  1.33s 1871 results
139120 phrases matched with a head...
114087 phrases remaining...

In [49]:

unaccounted_AdjP = set(phrase for phrase in remaining_phrases
                          if F.typ.v(phrase) == 'AdjP')
len(unaccounted_AdjP)

Out[49]:

In [50]:

remaining_types.remove('AdjP')
print(remaining_types)

['PP', 'NP', 'PrNP', 'AdvP']

AdvP¶

The adverb phrase has similar internal relations to AdjP. Thus, we apply the same basic template search.

By modifying pdp= parameter, I have found 2 examples of a preposition in the AdvP, which is caused by a prepositional phrase_atom coordinated with the AdvP phrase_atom. These mixed cases must be dealt with imperfectly by taking the preposition head literally. It is then up to the user of the heads feature to include/exclude cases such as these, or to depend on the phrase atoms.

There are two cases where an inrg serves as a head element. These are incorrect encodings, as they belong under their own phrase type of InrP. These should be fixed in a second BHSA. For now the inrg is excluded as a phrase head as they are followed by a advb which is probably triggering these phrases' classification.

There is one case in sentence 68 from Exodus 8:20 where a כל quantifier is incorrectly identified as a phrase head. This is because it precedes a prepositional element. That case is also excluded below.

In [51]:

AdvP = '''

phrase typ=AdvP
    iphrase_atom
        head:iword pdp=advb|subs|nmpr|prep
        
% require either NA subphrase relation
% or no subphrase embedding:
        /with/
        subphrase rela=NA|par
            head
        /or/
        /without/
        subphrase
            head
        /-/
        /-/
        
% exclude uses as modifier:
        /without/
        subphrase rela=adj|atr|rec|mod|dem
            head
        /-/
        
% ensure word is not immediately preceded by a construct form
        /without/
        phrase_atom
            word st=c
            <: head
        /-/
        
% ensure word is not immediately preceded by a prepositional form
        /without/
        phrase_atom
            word pdp=prep
            <: head
        /-/
'''
AdvP = A.search(AdvP, sets=sets)

for res in AdvP:
    phrase, head = res[0], res[2]
    record_head(phrase, head)
    
heads_status()

  1.78s 5776 results
144779 phrases matched with a head...
108428 phrases remaining...

In [52]:

unaccounted_AdvP = set(phrase for phrase in remaining_phrases
                          if F.typ.v(phrase) == 'AdvP')
len(unaccounted_AdvP)

Out[52]:

In [53]:

remaining_types.remove('AdvP')
print(remaining_types)

['PP', 'NP', 'PrNP']

PP¶

The same method used above applies to prepositional phrases.

In [54]:

PP = '''

phrase typ=PP
    iphrase_atom
        head:iword pdp=prep
        
% require either NA subphrase relation
% or no subphrase embedding:
        /with/
        subphrase rela=NA|par
            head
        /or/
        /without/
        subphrase
            head
        /-/
        /-/
        
% exclude uses as modifier:
        /without/
        subphrase rela=adj|atr|rec|mod|dem
            head
        /-/
        
% ensure word is not immediately preceded by a construct form
        /without/
        phrase_atom
            word st=c
            <: head
        /-/
        
% ensure word is not immediately preceded by a preposition
        /without/
        phrase_atom
            word pdp=prep
            <: head
        /-/
'''
PP = A.search(PP, sets=sets)

for res in PP:
    phrase, head = res[0], res[2]
    record_head(phrase, head)
    
heads_status()

  1.60s 61498 results
202261 phrases matched with a head...
50946 phrases remaining...

In [55]:

unaccounted_PP = set(phrase for phrase in remaining_phrases
                          if F.typ.v(phrase) == 'PP')
len(unaccounted_PP)

Out[55]:

In [56]:

remaining_types.remove('PP')
print(remaining_types)

['NP', 'PrNP']

Complex Heads¶

In contrast to the preceding phrase types, the noun phrase is much more complicated for head selection due to the presence of quantifiers. The search templates are thus quite lengthy. Each one has been rigorously tested, and each change has been run against a previous version of the template to ensure that any edits did not accidentally shorten or expand the search results beyond the desired effect.

`NP` and `PrNP`¶

Note that some noun phrases contain other phrase types, such as PP or even AdjP that are not indicated in the present implementation of the data. A second BHSA should seek to remedy this by spinning new phrases with their own types for these.

In [57]:

NP_heads = dict(
    
NP_noqant = f'''

phrase typ=NP|PrNP|DPrP|PPrP
    iphrase_atom
        head:nonquant pdp=subs|adjv|nmpr|prde|prps

% require either NA subphrase relation
% or no subphrase embedding:
        /with/
        subphrase rela=NA|par
            head
        /or/
        /without/
        subphrase
            head
        /-/
        /-/
        /with/
        = iword
        /-/
        
% exclude uses as modifier:
        /without/
        subphrase rela=adj|atr|rec|mod|dem
            head
        /-/
        
% ensure word is not immediately preceded by a construct form
        /without/
        phrase_atom
            word st=c
            <: head
        /-/
        
% ensure word is not immediately preceded by a verb (participle) + preposition
        /without/
        phrase_atom
            word sp=verb
            <: prep
            <: head
        /-/
        /without/
        phrase_atom
            word sp=verb
            <: prep
            <: word pdp=art
            <: head
        /-/
''',

NP_quant_alone = f'''

phrase typ=NP|PrNP|DPrP|PPrP
    iphrase_atom
        quantifier:quant

% quantifier does not precede a quantified element within a subphrase
        /without/
        subphrase
            quantifier
            < w1:nonquantprep pdp=adjv|subs|nmpr|prde|prps
            /with/
            = nonpostprep
            /-/
        /-/ 

% quantifier not immediately adjacent to quantified element within a phrase_atom
        /without/
        phrase_atom
            quantifier
            <: w1:nonquantprep pdp=subs|nmpr|prde|prps
        /-/
        /without/
        phrase_atom
            quantifier
            <: word pdp=art
            <: w1:nonquantprep pdp=subs|nmpr|prde|prps
        /-/
        /without/
        phrase_atom
            w1:nonquantprep pdp=subs|nmpr|prde|prps
            <: quantifier
        /-/
        /without/
        phrase_atom
            w1:nonquantprep pdp=subs|nmpr|prde|prps
            <: word pdp=art
            <: quantifier
        /-/
        
        
% quantifier is not construct with quantified element
        /without/
        quantifier
        <mother- subphrase rela=rec
            nonquantprep pdp=subs|nmpr|prde|prps
        /-/
        /without/
        phrase_atom
            quantifier st=c
            <: nonquantprep pdp=subs|nmpr|prde|prps
        /-/
    
% quantifier is not in another relation with a quantified element
        /without/
        s1:subphrase
            quantifier
        s2:subphrase rela=adj|atr|dem
            w1:nonquantprep pdp=subs|nmpr|prde|prps
            /with/
            = nonpostprep
            /-/
% exclude cases where a prepositional object occurs non-adjacently
            /without/
            subphrase
            /without/
                quant
            /-/
                =: prep
                w1
            /-/
        s1 <mother- s2
        /-/

% ensure quantifer is not in a quantifying chain
% there are numerous possible relations
        /without/
        phrase
            phrase_atom rela=NA|Para
            /with/
                s1:subphrase
                    nonquantprep pdp=subs|nmpr|prde|prps
                s2:subphrase rela=adj|atr
                    word ls=card
                s1 <mother- s2
            /or/
                s1:subphrase
                    word ls=card
                s2:subphrase rela=adj|atr
                    nonquantprep pdp=subs|nmpr|prde|prps
                    /with/
                    = nonpostprep
                    /-/
                s1 <mother- s2
            /or/
                word ls=card
                <mother- subphrase rela=rec
                    nonquantprep pdp=subs|nmpr|prde|prps
            /or/
                nonquantprep pdp=subs|nmpr|prde|prps
                <mother- subphrase rela=rec
                    word ls=card
            /or/
                nonquantprep pdp=subs|nmpr|prde|prps st=c
                <: word ls=card
            /-/
% quantifier is either a cardinal number of BN/ in chain
            w1:word
            /with/
            ls=card prs=absent
            /or/
            lex=BN/ prs=absent
            /-/
            
            quantifier = w1
        /-/

% exclude uses as modifier:
        /without/
        subphrase rela=adj|atr|rec|mod|dem
            quantifier
        /-/
        /with/
        = iword
        /-/
''',)

NP_complex = dict(NP_quantified = '''

phrase typ=NP|PrNP|DPrP|PPrP
    iphrase_atom
   
% ensure that word is quantified with a head-word quantifier
% NB: what follows is a long chain of specs on quantifier

        quantifier:quant

% quantifier not used in rec relations to non-prepositions
        /without/
        nonprep
        <mother- subphrase rela=rec
            quantifier
            w1:word
            /without/
            phrase_atom
                prep
                <: w1
            /-/
            /without/
            phrase_atom
                prep
                <: word pdp=art
                <: w1
            /-/
            w1 = quantifier
        /-/

% quantifier not used in adj relations to non-quantifiers
        /without/
        subphrase
        /with/
            nonquant pdp#conj|art
        /-/
        <mother- subphrase rela=adj
            quantifier
        /-/

% ------------------------------
% NB: what follows is a long chain of specs on head

% require adjacency to quantifier
        <1: subphrase
            head:nonquant pdp=subs|adjv|advb|nmpr|prde|prps
    
% quantified word is not a dependent modifier
% exclude non-quant construct state
            /without/
            nonquant st=c
            <: head
            /-/
            /without/
            nonquant st=c
            <: word pdp=art
            <: head
            /-/

% exclude non-quant rec relas
            /without/
            nonquantprep
            <mother- subphrase rela=rec
                head
            /-/
    
% exclude non-quant para rec relas
            /without/
            nonquantprep
            <mother- subphrase rela=rec
            <mother- subphrase rela=par
                head
            /-/
        
% exclude non-quant adjunct relas
            /without/
            subphrase
            /without/
                := quant
            /-/
            <mother- subphrase rela=adj
                head 
            /-/
    
% exclude non-quant para adjunct relas
            /without/
            subphrase
            /without/
                := quant
            /-/
            <mother- subphrase rela=adj
            <mother- subphrase rela=par
                head
            /-/

% exclude demonstrative relas when demonstrative points to subphrase with words other than quantifiers
            /without/
            subphrase
            /with/
                nonquant pdp#art|conj
            /-/
            <mother- subphrase rela=dem
                head 
            /-/

% exclude all other kinds of relations
            /without/
            subphrase rela=atr|mod
                head
            /-/
            /with/
            = iword
            /or/
            quant
            <: ..
            /or/
            quant
            <: word pdp=art
            <: ..
            /or/
            ..
            <: quant
            /-/
            
% exclude words with immediately preceding prepositions
            /without/
            prep
            <: head
            /-/
            /without/
            prep
            <: word pdp=art
            <: head
            /-/
''',)

query_heads(NP_heads, headi=2, sets=sets)
query_heads(NP_complex, headi=4, sets=sets)
print('\n', '<>'*20, '\n')
heads_status()

running query on NP_noqant
	57114 results found
running query on NP_quant_alone
	1764 results found
running query on NP_quantified
	5557 results found

 <><><><><><><><><><><><><><><><><><><><> 

253205 phrases matched with a head...
2 phrases remaining...

There are only 2 phrases left. Let's have a look at the remaining phrases...

In [58]:

for phrase in list(remaining_phrases)[:100]:
    print(f'phrase {F.number.v(phrase)}')
    A.prettyTuple((phrase,), seqNumber=phrase)

phrase 4

Result 842047

Psalms 59:4

842047

phrase 842047 Adju NP

320066

nega not

320067

פִשְׁעִ֖י

subs rebellion st=a

320068

conj and

320069

nega not

320070

חַטָּאתִ֣י

subs sin st=a

phrase 2

Result 865764

Proverbs 23:29

865764

phrase 865764 Subj NP

353272

אֹ֥וי

intj woe

Here we have negatives and interjections serving as the phrase heads of noun phrases. That might be considered a mistake by the BHSA. These are good candidates for analysis in a second BHSA.

For now, for the two remaining phrases we apply a two part head assignment. First, if there is a substantive remaining in the unaccounted phrase, then we assign that to the position of head. Second, if there is no valid substantive, then we take the only valid word without a dependent subphrase relation. The selection is done in this way to account for other future versions of the dataset that may have more than just these two cases.

In [59]:

print(f'Assigning last ditch heads to {len(remaining_phrases)} remaining phrases...')
last_ditch_assignments = {}

for phrase in remaining_phrases:
    subs = set(w for w in L.d(phrase, 'word') if F.pdp.v(w) == 'subs')
    if subs:
        phrase2heads[phrase] |= subs
        last_ditch_assignments[phrase] = subs
    else:
        head_candidates = set(w for w in L.d(phrase, 'word')
                                  if any([not set(F.rela.v(sp) for sp in L.u(w, 'subphrase')) - {'Para', 'NA'}, # independent rela check
                                          not L.u(w, 'subphrase')])  # or no subphrase containment
                             )
        phrase2heads[phrase] |= head_candidates
        last_ditch_assignments[phrase] = head_candidates
        
remaining_phrases.difference_update(set(last_ditch_assignments.keys()))
        
print(f'{len(remaining_phrases)} phrases left without a head...')

Assigning last ditch heads to 2 remaining phrases...
0 phrases left without a head...

In [60]:

# sanity check for assignment
for phrase, heads in last_ditch_assignments.items():
    A.prettyTuple((phrase,)+tuple(heads), seqNumber=0)

Result 0

Psalms 59:4

842047

phrase 842047 Adju NP

320066

nega not

320067

פִשְׁעִ֖י

subs rebellion st=a

320068

conj and

320069

nega not

320070

חַטָּאתִ֣י

subs sin st=a

Result 0

Proverbs 23:29

865764

phrase 865764 Subj NP

353272

אֹ֥וי

intj woe

Evaluating Heads v.2¶

Vs. Stephen Ku's Evaluation of `headsv.1`¶

Stephen Ku has kindly and laboriously documented missing head cases for heads v.1 (from April 21, see doc here). Below, I display Stephen's feedback and check to make sure his examples are indeed cured.

In [61]:

ku_eval = pd.read_csv('stephen_ku_heads_eval.csv', header=1).fillna('')
ku_eval['Decision'] = ''
ku_eval

Out[61]:

	Verse	Issue	Cause
0	Gen 39:4	missing יֵשׁ after כֹּל	יֵשׁ is not in the same clause as כֹּל
1	Num 5:2	missing two כֹּל’s (the one before צָר֖וּעַ an...
2	Num 5:2	it’s able to find the last כֹ֖ל but not טָמֵא ...	טָמֵא belongs to another clause and is not mar...
3	Num 31:7	missing זָכָר after כֹּל	זָכָר is an adjective
4	Num 31:17	missing זָכָ֖ר after כֹּל	זָכָר is an adjective
5	Num 31:17	missing כֹּל before אִשָּׁה
6	Deut 25:18	missing חשׁל after כֹּל	נֶּחֱשָׁלִ֣ים belongs to another clause and is...
7	Judg 13:4	missing טָמֵֽא after כֹּל	טָמֵֽא is an adjective
8	1 Sam 14:36	missing טוֹב after כֹּל	טוֹב is an adjective
9	1 Kgs 11:15	missing זָכָר after כֹּל	זָכָר is an adjective
10	1 Kgs 11:16	missing זָכָר after כֹּל	זָכָר is an adjective
11	Isa 43:7	missing הַנִּקְרָ֣א after כֹּל	נִּקְרָ֣א belongs to another clause and is a verb
12	Ps 73:27	missing זֹונֶ֥ה after כֹּל	זֹונֶ֥ה belongs to another clause and is a verb
13	Ps 119:118	missing שֹׁוגִ֣ים after כֹּל	שֹׁוגִ֣ים belongs to another clause and is a verb
14	Job 40:11	missing גֵּ֝אֶ֗ה after כֹּל	גֵּ֝אֶ֗ה is an adjective
15	Job 40:12	missing גֵּ֝אֶ֗ה after כֹּל	גֵּ֝אֶ֗ה is an adjective
16	Prov 6:29	missing נֹּגֵ֥עַ after כֹּל	נֹּגֵ֥עַ belongs to another clause and is a verb
17	Prov 20:8	missing רָֽע after כֹּל	רָֽע is an adjective
18	2 Chr 13:9	missing בָּ֗א after כֹּל	בָּ֗א belongs to another clause and is a verb
19	2 Chr 22:1	missing רִאשֹׁנִים֙ after כֹּל	רִאשֹׁנִים֙ is an ordinal?

In [62]:

abb2book = {'Gen': 'Genesis', 
            'Num':'Numbers', 
            'Deut': 'Deuteronomy',
            'Judg': 'Judges',
            '1_Sam': '1_Samuel',
            '1_Kgs': '1_Kings',
            'Isa': 'Isaiah',
            'Ps': 'Psalms',
            'Job': 'Job',
            'Prov': 'Proverbs',
            '2_Chr': '2_Chronicles'}

my_decisions = {'not yet going out of clause boundaries': {0,1, 2 ,6, 8, 11, 12 , 13, 16, 18},
                'fixed': {3, 4, 5, 7, 9, 10, 14, 15, 17, 19},}

for decision, numbers in my_decisions.items():
    for num in numbers:
        ku_eval.loc[num]['Decision'] = decision
        

for i, ref in enumerate(ku_eval.Verse):

    book, ch_vs = ref.split() if not re.match('^\d', ref) else ref.replace(' ', '_', 1).split()
    book = abb2book[book.strip()]
    chapter, verse = ch_vs.split(':')
    
    # get TF ref
    tf_node = T.nodeFromSection((book, int(chapter), int(verse)))
    for phrase in L.d(tf_node, 'phrase'):
        if 'KL/' in [F.lex.v(w) for w in L.d(phrase, 'word')]:
            print(ref)
            print(f'v1 issue: {ku_eval.Issue[i]}')
            print(f'v2 update: {ku_eval.Decision[i]}')
            A.prettyTuple((phrase,)+tuple(phrase2heads[phrase]), seqNumber=f'Ku Eval {i}', condenseType='clause', withNodes=True)

Gen 39:4
v1 issue: missing יֵשׁ after כֹּל
v2 update: not yet going out of clause boundaries

Result Ku Eval 0

Genesis 39:4

431967

clause 431967 WxQ0|Defc

phrase 664857 Conj CP

21481

conj and

phrase 664858 Objc NP

21482

subs whole st=c

clause 431967 WxQ0|ZQt0

phrase 664861 Pred VP

21485

נָתַ֥ן

verb give qal perf

phrase 664862 Cmpl PP

21486

prep in

21487

יָדֹֽו׃

subs hand st=a

Num 5:2
v1 issue: missing two כֹּל’s (the one before צָר֖וּעַ and the one before זָ֑ב)
v2 update: not yet going out of clause boundaries

Result Ku Eval 1

Numbers 5:2

441351

clause 441351 WYq0

phrase 693489 Conj CP

72735

conj and

phrase 693490 Pred VP

72736

ישַׁלְּחוּ֙

verb send piel impf

phrase 693491 Cmpl PP

72737

prep from

72738

art the

72739

מַּחֲנֶ֔ה

subs camp st=a

phrase 693492 Objc NP

72740

subs whole st=c

72741

צָר֖וּעַ

subs have skin-disease qal ptcp st=a

72742

conj and

72743

subs whole st=c

72744

זָ֑ב

subs flow qal ptca st=a

72745

conj and

72746

כֹ֖ל

subs whole st=c

Num 5:2
v1 issue: it’s able to find the last כֹ֖ל but not טָמֵא following it
v2 update: not yet going out of clause boundaries

Result Ku Eval 2

Numbers 5:2

441351

clause 441351 WYq0

phrase 693489 Conj CP

72735

conj and

phrase 693490 Pred VP

72736

ישַׁלְּחוּ֙

verb send piel impf

phrase 693491 Cmpl PP

72737

prep from

72738

art the

72739

מַּחֲנֶ֔ה

subs camp st=a

phrase 693492 Objc NP

72740

subs whole st=c

72741

צָר֖וּעַ

subs have skin-disease qal ptcp st=a

72742

conj and

72743

subs whole st=c

72744

זָ֑ב

subs flow qal ptca st=a

72745

conj and

72746

כֹ֖ל

subs whole st=c

Num 31:7
v1 issue: missing זָכָר after כֹּל
v2 update: fixed

Result Ku Eval 3

Numbers 31:7

444275

clause 444275 Way0

phrase 702270 Conj CP

88990

conj and

phrase 702271 Pred VP

88991

יַּֽהַרְג֖וּ

verb kill qal wayq

phrase 702272 Objc NP

88992

subs whole st=c

88993

זָכָֽר׃

subs male st=a

Num 31:17
v1 issue: missing זָכָ֖ר after כֹּל
v2 update: fixed

Result Ku Eval 4

444294

clause 444294 ZIm0

phrase 702337 Pred VP

89191

הִרְג֥וּ

verb kill qal impv

phrase 702338 Objc NP

89192

subs whole st=c

89193

subs male st=a

phrase 702338 Objc NP|PP

89194

prep in

89195

art the

89196

טָּ֑ף

subs <those unable to march> st=a

Num 31:17
v1 issue: missing זָכָ֖ר after כֹּל
v2 update: fixed

Result Ku Eval 4

444295

clause 444295 WxI0|Defc

phrase 702339 Conj CP

89197

conj and

phrase 702340 Objc NP

89198

subs whole st=c

89199

אִשָּׁ֗ה

subs woman st=a

clause 444295 WxI0|ZIm0

phrase 702344 Pred VP

89205

הֲרֹֽגוּ׃

verb kill qal impv

Num 31:17
v1 issue: missing כֹּל before אִשָּׁה
v2 update: fixed

Result Ku Eval 5

444294

clause 444294 ZIm0

phrase 702337 Pred VP

89191

הִרְג֥וּ

verb kill qal impv

phrase 702338 Objc NP

89192

subs whole st=c

89193

subs male st=a

phrase 702338 Objc NP|PP

89194

prep in

89195

art the

89196

טָּ֑ף

subs <those unable to march> st=a

Num 31:17
v1 issue: missing כֹּל before אִשָּׁה
v2 update: fixed

Result Ku Eval 5

444295

clause 444295 WxI0|Defc

phrase 702339 Conj CP

89197

conj and

phrase 702340 Objc NP

89198

subs whole st=c

89199

אִשָּׁ֗ה

subs woman st=a

clause 444295 WxI0|ZIm0

phrase 702344 Pred VP

89205

הֲרֹֽגוּ׃

verb kill qal impv

Deut 25:18
v1 issue: missing חשׁל after כֹּל
v2 update: not yet going out of clause boundaries

Result Ku Eval 6

Deuteronomy 25:18

447578

clause 447578 Coor Way0

phrase 712409 Conj CP

107109

conj and

phrase 712410 Pred VP

107110

יְזַנֵּ֤ב

verb cut off piel wayq

phrase 712411 Cmpl PP

107111

בְּךָ֙

prep in

phrase 712412 Objc NP

107112

subs whole st=c

Judg 13:4
v1 issue: missing טָמֵֽא after כֹּל
v2 update: fixed

Result Ku Eval 7

Judges 13:4

452736

clause 452736 WxY0

phrase 727954 Conj CP

135761

conj and

phrase 727955 Nega NegP

135762

אַל־

nega not

phrase 727956 Pred VP

135763

תֹּאכְלִ֖י

verb eat qal impf

phrase 727957 Objc NP

135764

subs whole st=c

135765

טָמֵֽא׃

subs unclean st=a

1 Sam 14:36
v1 issue: missing טוֹב after כֹּל
v2 update: not yet going out of clause boundaries

Result Ku Eval 8

1_Samuel 14:36

455548

clause 455548 xIm0|Defc

phrase 736484 Objc NP

149171

subs whole st=c

clause 455548 xIm0|ZIm0

phrase 736488 Pred VP

149176

עֲשֵׂ֑ה ס

verb make qal impv

1 Kgs 11:15
v1 issue: missing זָכָר after כֹּל
v2 update: fixed

Result Ku Eval 9

1_Kings 11:15

462722

clause 462722 Coor Way0

phrase 758181 Conj CP

185299

conj and

phrase 758182 Pred VP

185300

יַּ֥ךְ

verb strike hif wayq

phrase 758183 Objc NP

185301

subs whole st=c

185302

subs male st=a

phrase 758183 Objc NP|PP

185303

בֶּ

prep in

185304

אֱדֹֽום׃

nmpr Edom st=a

1 Kgs 11:16
v1 issue: missing זָכָר after כֹּל
v2 update: fixed

Result Ku Eval 10

1_Kings 11:16

462723

clause 462723 xQtX

phrase 758184 Conj CP

185305

כִּ֣י

conj that

phrase 758185 Time NP

185306

שֵׁ֧שֶׁת

subs six st=c

185307

חֳדָשִׁ֛ים

subs month st=a

phrase 758186 Pred VP

185308

יָֽשַׁב־

verb sit qal perf

phrase 758187 Cmpl AdvP

185309

שָׁ֥ם

advb there

phrase 758188 Subj PrNP

185310

יֹואָ֖ב

nmpr Joab st=a

185311

conj and

185312

subs whole st=c

185313

nmpr Israel st=a

1 Kgs 11:16
v1 issue: missing זָכָר after כֹּל
v2 update: fixed

Result Ku Eval 10

1_Kings 11:16

462724

clause 462724 xQt0

phrase 758189 Conj CP

185314

conj unto

phrase 758190 Pred VP

185315

הִכְרִ֥ית

verb cut hif perf

phrase 758191 Objc NP

185316

subs whole st=c

185317

subs male st=a

phrase 758191 Objc NP|PP

185318

בֶּ

prep in

185319

אֱדֹֽום׃

nmpr Edom st=a

Isa 43:7
v1 issue: missing הַנִּקְרָ֣א after כֹּל
v2 update: not yet going out of clause boundaries

Result Ku Eval 11

Isaiah 43:7

471639

clause 471639 Ellp

phrase 783566 Objc NP

227043

כֹּ֚ל

subs whole st=c

Ps 73:27
v1 issue: missing זֹונֶ֥ה after כֹּל
v2 update: not yet going out of clause boundaries

Result Ku Eval 12

Psalms 73:27

493806

clause 493806 Coor ZQt0

phrase 843943 Pred VP

322765

הִ֝צְמַ֗תָּה

verb be silent hif perf

phrase 843944 Objc NP

322766

subs whole st=c

Ps 119:118
v1 issue: missing שֹׁוגִ֣ים after כֹּל
v2 update: not yet going out of clause boundaries

Result Ku Eval 13

Psalms 119:118

496425

clause 496425 ZQt0

phrase 850462 Pred VP

332217

סָ֭לִיתָ

verb reject qal perf

phrase 850463 Objc NP

332218

subs whole st=c

Job 40:11
v1 issue: missing גֵּ֝אֶ֗ה after כֹּל
v2 update: fixed

Result Ku Eval 14

Job 40:11

500267

clause 500267 WIm0

phrase 860845 Conj CP

346212

conj and

phrase 860846 Pred VP

346213

רְאֵ֥ה

verb see qal impv

phrase 860847 Objc NP

346214

subs whole st=c

346215

גֵּ֝אֶ֗ה

subs haughty st=a

Job 40:12
v1 issue: missing גֵּ֝אֶ֗ה after כֹּל
v2 update: fixed

Result Ku Eval 15

Job 40:12

500269

clause 500269 ZIm0

phrase 860850 Pred VP

346218

רְאֵ֣ה

verb see qal impv

phrase 860851 Objc NP

346219

subs whole st=c

346220

גֵּ֭אֶה

subs haughty st=a

Prov 6:29
v1 issue: missing נֹּגֵ֥עַ after כֹּל
v2 update: not yet going out of clause boundaries

Result Ku Eval 16

Proverbs 6:29

500863

clause 500863 xYqX

phrase 862467 Nega NegP

348522

לֹ֥א

nega not

phrase 862468 Pred VP

348523

יִ֝נָּקֶ֗ה

verb be clean nif impf

phrase 862469 Subj NP

348524

כָּֽל־

subs whole st=c

Prov 20:8
v1 issue: missing רָֽע after כֹּל
v2 update: fixed

Result Ku Eval 17

Proverbs 20:8

501856

clause 501856 Ptcp|Defc

phrase 865036 Subj NP

352194

מֶ֗לֶךְ

subs king st=a

clause 501856 Ptcp

phrase 865039 PreC VP

352199

מְזָרֶ֖ה

verb scatter piel ptca st=a

phrase 865040 Adju PP

352200

prep in

352201

עֵינָ֣יו

subs eye st=a

phrase 865041 Objc NP

352202

subs whole st=c

352203

רָֽע׃

subs evil st=a

2 Chr 13:9
v1 issue: missing בָּ֗א after כֹּל
v2 update: not yet going out of clause boundaries

Result Ku Eval 18

2_Chronicles 13:9

513398

clause 513398 Ellp

phrase 897827 Objc NP

413167

subs whole st=c

2 Chr 22:1
v1 issue: missing רִאשֹׁנִים֙ after כֹּל
v2 update: fixed

Result Ku Eval 19

2_Chronicles 22:1

514152

clause 514152 xQtX

phrase 900085 Conj CP

417283

כִּ֤י

conj that

phrase 900086 Objc NP

417284

subs whole st=c

417285

art the

417286

רִאשֹׁנִים֙

subs first st=a

phrase 900087 Pred VP

417287

הָרַ֣ג

verb kill qal perf

phrase 900088 Subj NP

417288

art the

417289

גְּד֔וּד

subs band st=a

By way of overview, here are the previous issues and the new decisions.

In [63]:

ku_eval

Out[63]:

	Verse	Issue	Cause	Decision
0	Gen 39:4	missing יֵשׁ after כֹּל	יֵשׁ is not in the same clause as כֹּל	not yet going out of clause boundaries
1	Num 5:2	missing two כֹּל’s (the one before צָר֖וּעַ an...		not yet going out of clause boundaries
2	Num 5:2	it’s able to find the last כֹ֖ל but not טָמֵא ...	טָמֵא belongs to another clause and is not mar...	not yet going out of clause boundaries
3	Num 31:7	missing זָכָר after כֹּל	זָכָר is an adjective	fixed
4	Num 31:17	missing זָכָ֖ר after כֹּל	זָכָר is an adjective	fixed
5	Num 31:17	missing כֹּל before אִשָּׁה		fixed
6	Deut 25:18	missing חשׁל after כֹּל	נֶּחֱשָׁלִ֣ים belongs to another clause and is...	not yet going out of clause boundaries
7	Judg 13:4	missing טָמֵֽא after כֹּל	טָמֵֽא is an adjective	fixed
8	1 Sam 14:36	missing טוֹב after כֹּל	טוֹב is an adjective	not yet going out of clause boundaries
9	1 Kgs 11:15	missing זָכָר after כֹּל	זָכָר is an adjective	fixed
10	1 Kgs 11:16	missing זָכָר after כֹּל	זָכָר is an adjective	fixed
11	Isa 43:7	missing הַנִּקְרָ֣א after כֹּל	נִּקְרָ֣א belongs to another clause and is a verb	not yet going out of clause boundaries
12	Ps 73:27	missing זֹונֶ֥ה after כֹּל	זֹונֶ֥ה belongs to another clause and is a verb	not yet going out of clause boundaries
13	Ps 119:118	missing שֹׁוגִ֣ים after כֹּל	שֹׁוגִ֣ים belongs to another clause and is a verb	not yet going out of clause boundaries
14	Job 40:11	missing גֵּ֝אֶ֗ה after כֹּל	גֵּ֝אֶ֗ה is an adjective	fixed
15	Job 40:12	missing גֵּ֝אֶ֗ה after כֹּל	גֵּ֝אֶ֗ה is an adjective	fixed
16	Prov 6:29	missing נֹּגֵ֥עַ after כֹּל	נֹּגֵ֥עַ belongs to another clause and is a verb	not yet going out of clause boundaries
17	Prov 20:8	missing רָֽע after כֹּל	רָֽע is an adjective	fixed
18	2 Chr 13:9	missing בָּ֗א after כֹּל	בָּ֗א belongs to another clause and is a verb	not yet going out of clause boundaries
19	2 Chr 22:1	missing רִאשֹׁנִים֙ after כֹּל	רִאשֹׁנִים֙ is an ordinal?	fixed

As indicated in the decisions, I have decided to not yet extend beyond the clause boundaries for situations where כל quantifies an entire clause. That step may require further methodological evaluation about the role of clause and phrase embeddings. This is a task that is better suited within a whole new data model, such as a second BHSA can provide.

All other cases have been fixed by the new heads selection procedures. This already shows that the new heads performs better than the previous version.

Statistical Evaluation¶

Below I provide statistical counts and visualizations for the head assignments.

In [64]:

typ2pdpcounts = collections.defaultdict(lambda: collections.Counter())

for phrase, heads in phrase2heads.items():
    typ = F.typ.v(phrase)
    pdps = [F.pdp.v(head) for head in heads]
    typ2pdpcounts[typ].update(pdps)
    
typ2pdpcounts = pd.DataFrame(typ2pdpcounts).fillna(0)

In [65]:

sns.set(style='whitegrid', font_scale=1.4)

for typ in typ2pdpcounts:
    positive = typ2pdpcounts[typ][typ2pdpcounts[typ] > 0].sort_values(ascending=False)
    
    print(f'Parts of Speech for {typ}')
    display(pd.DataFrame(positive))
    
    plt.figure(figsize=(5, 3))
    sns.barplot(x=positive.index, y=positive)
    plt.ylabel('frequency')
    plt.show()

Parts of Speech for PPrP

	PPrP
prps	4386.0
subs	262.0
nmpr	21.0

Parts of Speech for DPrP

	DPrP
prde	790.0

Parts of Speech for InjP

	InjP
intj	1883.0

Parts of Speech for NegP

	NegP
nega	6742.0

Parts of Speech for InrP

	InrP
inrg	1291.0

Parts of Speech for IPrP

	IPrP
prin	798.0

Parts of Speech for VP

	VP
verb	69024.0

Parts of Speech for CP

	CP
conj	51341.0
prep	1140.0

Parts of Speech for `AdjP`

	AdjP
adjv	1866.0
subs	4.0
advb	1.0

Parts of Speech for AdvP

	AdvP
advb	5172.0
subs	368.0
nmpr	234.0
prep	2.0

Parts of Speech for PP

	PP
prep	61498.0

Parts of Speech for NP

	NP
subs	44729.0
nmpr	149.0
adjv	117.0
prde	49.0
prps	4.0
intj	1.0

Parts of Speech for PrNP

	PrNP
nmpr	11633.0
subs	331.0
prps	1.0

Evaluating Quantifiers¶

By far, quantifiers are the most tricky of issues involved in picking out heads. Let's evaluate how many and which quantifiers have been selected.

In all cases, a quantifier should only be selected if it is not followed by a quantified noun. These quantifiers are selected with the NP_quant_alone query above. A quantifier should not have been selected in any other case.

In [66]:

len([head for phrase, heads in phrase2heads.items()
            for head in heads if head in quantifiers
            if F.typ.v(phrase) in {'NP', 'PrNP'}]) <= 1731 # less or equal quantifier heads than what NP_quant_alone found

Out[66]:

False

This is a good result. Let's check whether there are quantifiers in other phrase types.

In [67]:

A.show([(phrase, head) for phrase, heads in phrase2heads.items()
            for head in heads if head in quantifiers
            if F.typ.v(phrase) not in {'NP', 'PrNP'}], condenseType='phrase', withNodes=True)

phrase 1

Genesis 20:7

656961

phrase 656961 Subj PPrP

9394

אַתָּ֖ה

prps you

9395

conj and

9396

subs whole st=c

phrase 2

Genesis 31:21

661820

phrase 661820 Subj PPrP

16745

הוּא֙

prps he

16746

conj and

16747

subs whole st=c

phrase 3

Genesis 45:10

667639

phrase 667639 Subj PPrP

25651

אַתָּ֕ה

prps you

25652

conj and

25653

בָנֶ֖יךָ

subs son st=a

25654

conj and

25655

בְנֵ֣י

subs son st=c

25656

בָנֶ֑יךָ

subs son st=a

25657

conj and

25658

צֹאנְךָ֥

subs cattle st=a

25659

conj and

25660

בְקָרְךָ֖

subs cattle st=a

25661

conj and

25662

subs whole st=c

phrase 4

Genesis 45:11

667651

phrase 667651 Subj PPrP

25676

אַתָּ֥ה

prps you

25677

conj and

25678

בֵֽיתְךָ֖

subs house st=a

25679

conj and

25680

subs whole st=c

phrase 5

Exodus 24:1

677392

phrase 677392 Subj PPrP

41608

אַתָּה֙

prps you

41609

conj and

41610

אַהֲרֹן֙

nmpr Aaron st=a

41611

נָדָ֣ב

nmpr Nadab st=a

41612

conj and

41613

אֲבִיה֔וּא

nmpr Abihu st=a

41614

conj and

41615

שִׁבְעִ֖ים

subs seven st=a

phrase 677392 Subj PPrP|PP

41616

prep from

41617

זִּקְנֵ֣י

subs old st=c

41618

nmpr Israel st=a

phrase 6

Numbers 16:33

697762

phrase 697762 Subj PPrP

80695

הֵ֣ם

prps they

80696

conj and

80697

subs whole st=c

phrase 7

Judges 7:18

725606

phrase 725606 Subj PPrP

131896

אָנֹכִ֖י

prps i

131897

conj and

131898

subs whole st=c

phrase 8

1_Kings 20:4

762275

phrase 762275 Subj PPrP

191995

אֲנִ֖י

prps i

191996

conj and

191997

subs whole st=c

phrase 9

Genesis 4:15

652681

phrase 652681 Modi AdvP

1915

שִׁבְעָתַ֖יִם

advb seven st=a

phrase 10

Genesis 4:24

652779

phrase 652779 Modi AdvP

2068

שִׁבְעָתַ֖יִם

advb seven st=a

phrase 11

Exodus 23:30

677350

phrase 677350 Modi AdvP

41535

מְעַ֥ט

advb little st=a

41536

מְעַ֛ט

advb little st=a

phrase 12

Exodus 23:30

677350

phrase 677350 Modi AdvP

41535

מְעַ֥ט

advb little st=a

41536

מְעַ֛ט

advb little st=a

phrase 13

Deuteronomy 7:22

706975

phrase 706975 Modi AdvP

97910

מְעַ֣ט

advb little st=a

97911

מְעָ֑ט

advb little st=a

phrase 14

Deuteronomy 7:22

706975

phrase 706975 Modi AdvP

97910

מְעַ֣ט

advb little st=a

97911

מְעָ֑ט

advb little st=a

phrase 15

2_Samuel 12:6

747419

phrase 747419 Modi AdvP

166495

אַרְבַּעְתָּ֑יִם

advb four st=a

phrase 16

2_Samuel 16:1

749586

phrase 749586 Modi AdvP

169746

מְעַט֙

advb little st=a

phrase 17

2_Kings 10:18

768569

phrase 768569 Modi AdvP

201575

מְעָ֑ט

advb little st=a

phrase 18

Isaiah 30:26

780393

phrase 780393 Modi AdvP

222298

שִׁבְעָתַ֔יִם

advb seven st=a

phrase 19

Jeremiah 42:2

802718

phrase 802718 Modi AdvP

257841

מְעַט֙

advb little st=a

phrase 20

Jeremiah 51:33

806024

phrase 806024 Modi AdvP

263328

מְעַ֔ט

advb little st=a

phrase 21

Ezekiel 11:16

809512

phrase 809512 Modi AdvP

269147

מְעַ֔ט

advb little st=a

phrase 22

Psalms 8:6

835973

phrase 835973 Modi AdvP

311515

מְּ֭עַט

advb little st=a

phrase 23

Psalms 12:7

836397

phrase 836397 Modi AdvP

312132

שִׁבְעָתָֽיִם׃

advb seven st=a

phrase 24

Psalms 79:12

845066

phrase 845066 Modi AdvP

324385

שִׁ֭בְעָתַיִם

advb seven st=a

phrase 25

Job 10:20

855120

phrase 855120 Modi AdvP

338787

מְּעָֽט׃

advb little st=a

phrase 26

Job 24:24

857792

phrase 857792 Modi AdvP

342165

מְּעַ֨ט׀

advb little st=a

phrase 27

Proverbs 6:31

862485

phrase 862485 Modi AdvP

348543

שִׁבְעָתָ֑יִם

advb seven st=a

phrase 28

Ruth 2:7

867938

phrase 867938 Modi AdvP

356376

מְעָֽט׃

advb little st=a

phrase 29

Ecclesiastes 5:11

871010

phrase 871010 Modi AdvP

361060

מְעַ֥ט

advb little st=a

These quantifiers are all well-chosen.

Next, I want to check whether there are any cases of a cardinal number and a substantive occurring together as head elements. These combinations often identify mismatched quantifiers.

In [68]:

card_mix = [(phrase,)+tuple(heads) for phrase, heads in phrase2heads.items()
                 if [w for w in heads if F.ls.v(w) == 'card']
                 and [w for w in heads if F.ls.v(w) != 'card']]

len(card_mix)

Out[68]:

In [69]:

A.show(card_mix, condenseType='phrase', withNodes=True, end=100)

phrase 1

Exodus 24:1

677392

phrase 677392 Subj PPrP

41608

אַתָּה֙

prps you

41609

conj and

41610

אַהֲרֹן֙

nmpr Aaron st=a

41611

נָדָ֣ב

nmpr Nadab st=a

41612

conj and

41613

אֲבִיה֔וּא

nmpr Abihu st=a

41614

conj and

41615

שִׁבְעִ֖ים

subs seven st=a

phrase 677392 Subj PPrP|PP

41616

prep from

41617

זִּקְנֵ֣י

subs old st=c

41618

nmpr Israel st=a

phrase 2

Exodus 24:9

677497

phrase 677497 Subj PrNP

41784

מֹשֶׁ֖ה

nmpr Moses st=a

41785

conj and

41786

אַהֲרֹ֑ן

nmpr Aaron st=a

41787

נָדָב֙

nmpr Nadab st=a

41788

conj and

41789

אֲבִיה֔וּא

nmpr Abihu st=a

41790

conj and

41791

שִׁבְעִ֖ים

subs seven st=a

phrase 677497 Subj PrNP|PP

41792

prep from

41793

זִּקְנֵ֥י

subs old st=c

41794

יִשְׂרָאֵֽל׃

nmpr Israel st=a

phrase 3

Joshua 15:32

720376

phrase 720376 PreC NP

121906

subs twenty st=a

121907

conj and

121908

תֵ֖שַׁע

subs nine st=a

121909

conj and

121910

חַצְרֵיהֶֽן׃ ס

subs court st=a

phrase 4

2_Kings 1:9

764206

phrase 764206 Objc NP

194958

subs chief st=c

194959

חֲמִשִּׁ֖ים

subs five st=a

194960

conj and

194961

subs five st=a

phrase 5

2_Kings 1:11

764249

phrase 764249 Objc NP

195024

subs chief st=c

195025

חֲמִשִּׁ֥ים

subs five st=a

195026

אַחֵ֖ר

adjv other st=a

195027

conj and

195028

subs five st=a

phrase 6

2_Kings 1:13

764287

phrase 764287 Objc NP

195082

subs chief st=c

195083

חֲמִשִּׁ֥ים

subs five st=a

195084

שְׁלִשִׁ֖ים

adjv third st=a

195085

conj and

195086

subs five st=a

phrase 7

Nehemiah 13:20

887266

phrase 887266 Modi NP

391009

פַּ֥עַם

subs foot st=a

391010

conj and

391011

שְׁתָּֽיִם׃

subs two st=a

It is good that there are only 7 such cases. All of these cases appear to be permissible combinations of quantifiers and nominals, where the quantifier stands as its own element. In the first case, it is perhaps conceivable that the quantifier should be omitted, and זכן be selected instead. However, such a situation is a semantic choice that cannot be easily automated. There are many additional concerns that this method would need to address.

Next, I want to know whether any cardinal quantifiers have unfairly been excluded from head roles by the strict requirements for standalone quantifiers. This would occur in phrases that only contain standalone quantifiers. This will require a search of its own, followed by a comparison of the search's heads with the selected heads.

In [70]:

all_cards = A.search(f'''

phrase typ=NP|PrNP

% all non-conj./art. words in phrase are cardinals
/where/
    word pdp#conj|art
/have/
    ls=card
/-/

    phrase_atom rela=NA|Para
        quantifier:quant

% require either NA subphrase relation
% or no subphrase embedding:
        /with/
        subphrase rela=NA|par
            quantifier
        /or/
        /without/
        subphrase
            quantifier
        /-/
        /-/

% exclude uses as modifier:
        /without/
        subphrase rela=adj|atr|rec|mod|dem
            quantifier
        /-/
        
% ensure word is not immediately preceded by a construct form
        /without/
        phrase_atom
            word st=c
            <: quantifier
        /-/

% ensure quantifier is not immediately preceded by a preposition
        /without/
        phrase_atom
            word pdp=prep
            <: quantifier
        /-/
        /without/
        phrase_atom
            word pdp=prep
            <: word pdp=art
            <: quantifier
        /-/
''', sets={'quant': quantifiers})

all_cards_heads = set(res[2] for res in all_cards)
select_cards_heads = set(head for phrase, heads in phrase2heads.items() for head in heads if F.ls.v(head) == 'card')

print(f'\n{len(all_cards_heads - select_cards_heads)} missing standalone quantifiers...')

  1.14s 1031 results

0 missing standalone quantifiers...

This is a good result. The search found 1031 phrase heads in cardinal-only phrases. All of these heads are contained in the heads2 dataset.

Note that the NP_quant_alone pattern found more. This is expected since NP_quant_alone is a more sophisticated search that checks between phrase atoms for various relations. We have already tested above to make sure those selections are valid. But just to be sure, which results did the all_cards pattern above not find? Below is a small sampling. They often have to do with additional modifiers on the cardinal number contained in a separate phrase_atom.

In [71]:

test = [(L.u(head, 'phrase')[0],)+tuple(phrase2heads[L.u(head, 'phrase')[0]]) for head in select_cards_heads - all_cards_heads]

A.show(test, condenseType='phrase', withNodes=True, end=5)

phrase 1

1_Kings 10:22

757950

phrase 757950 Time NP

184839

אַחַת֩

subs one st=a

phrase 757950 Time NP|PP

184840

prep to

184841

שָׁלֹ֨שׁ

subs three st=a

184842

שָׁנִ֜ים

subs year st=a

phrase 2

Ezra 8:24

883029

phrase 883029 Objc NP

381959

שְׁנֵ֣ים

subs two st=a

381960

עָשָׂ֑ר

subs -teen st=a

phrase 883029 Objc NP|PP

381961

prep to

381962

שֵׁרֵֽבְיָ֣ה

nmpr Sherebiah st=a

381963

חֲשַׁבְיָ֔ה

nmpr Hashabiah st=a

phrase 3

Genesis 7:2

653340

phrase 653340 Objc NP

3083

שְׁנַ֖יִם

subs two st=a

phrase 653340 Objc NP

3084

אִ֥ישׁ

subs man st=a

3085

conj and

3086

אִשְׁתֹּֽו׃

subs woman st=a

phrase 4

2_Kings 1:13

764287

phrase 764287 Objc NP

195082

subs chief st=c

195083

חֲמִשִּׁ֥ים

subs five st=a

195084

שְׁלִשִׁ֖ים

adjv third st=a

195085

conj and

195086

subs five st=a

phrase 5

Genesis 4:24

652779

phrase 652779 Modi AdvP

2068

שִׁבְעָתַ֖יִם

advb seven st=a

Inspecting Missing Relation Cases¶

At the beginning of this notebook, I made numerous exclusions of heads using the dword (dependent word) and iword (independent word) sets. This is because the BHSA omits multiple relations between words that should be there. We have excluded those words. Let's evaluate how well those phrases performed in the head selection process:

In [72]:

for word in L.d(792539, 'word'):
    print(word, T.text(word), word in dwords)

239841 חֶ֛סֶד  False
239842 מִשְׁפָּ֥ט  False
239843 וּ False
239844 צְדָקָ֖ה  False

In [73]:

show_subphrases(792539)

-------1363416----------------

חֶ֛סֶד מִשְׁפָּ֥ט  -NA-> 
nodes:  1363416 -NA-> 
slots:  [239841, 239842] -NA-> ()
------------------------------
-------1363417----------------

צְדָקָ֖ה  -par-> חֶ֛סֶד מִשְׁפָּ֥ט 
nodes:  1363417 -par-> 1363416
slots:  [239844] -par-> [239841, 239842]
------------------------------

In [74]:

NP_missing_atr_rec = [(res[0],) + tuple(phrase2heads[res[0]]) for res in missing_atr_rec 
                          if F.typ.v(res[0]) in {'NP', 'PrNP'}]

A.show(NP_missing_atr_rec)

phrase 1

2_Samuel 24:9

753091

phrase 753091 PreC NP

175628

שְׁמֹנֶה֩

subs eight st=a

175629

מֵאֹ֨ות

subs hundred st=a

175630

אֶ֤לֶף

subs thousand st=a

175631

אִֽישׁ־

subs man st=a

175632

חַ֨יִל֙

subs power st=a

phrase 2

1_Kings 5:3

755341

phrase 755341 PreC NP

179384

עֲשָׂרָ֨ה

subs ten st=a

179385

בָקָ֜ר

subs cattle st=a

179386

בְּרִאִ֗ים

adjv fat st=a

179387

conj and

179388

subs twenty st=a

179389

בָּקָ֛ר

subs cattle st=a

179390

רְעִ֖י

subs pasture st=a

179391

conj and

179392

מֵ֣אָה

subs hundred st=a

179393

צֹ֑אן

subs cattle st=a

phrase 3

Proverbs 21:23

865306

phrase 865306 Subj NP

352627

שֹׁמֵ֣ר

subs keep qal ptca st=c

352628

פִּ֭יו

subs mouth st=a

352629

conj and

352630

לְשֹׁונֹ֑ו

subs tongue st=a

phrase 4

Leviticus 13:59

686898

phrase 686898 PreC NP

60185

תֹּורַ֨ת

subs instruction st=c

60186

נֶֽגַע־

subs stroke st=c

60187

צָרַ֜עַת

subs skin-disease st=c

60188

בֶּ֥גֶד

subs garment st=c

60189

art the

60190

צֶּ֣מֶר׀

subs wool st=a

60191

conj or

60192

art the

60193

פִּשְׁתִּ֗ים

subs flax st=a

60194

אֹ֤ו

conj or

60195

art the

60196

שְּׁתִי֙

subs texture st=a

60197

conj or

60198

art the

60199

עֵ֔רֶב

subs woof st=a

60200

אֹ֖ו

conj or

60201

subs whole st=c

60202

subs tool st=c

60203

עֹ֑ור

subs skin st=a

phrase 5

702367

phrase 702367 Objc NP

subs whole st=c

subs garment st=a

conj and

subs whole st=c

subs tool st=c

subs skin st=a

conj and

subs whole st=c

subs deed st=c

subs goat st=a

conj and

subs whole st=c

subs tool st=c

subs tree st=a

phrase 6

2_Chronicles 30:12

902589

phrase 902589 Objc NP

422131

מִצְוַ֥ת

subs commandment st=c

422132

art the

422133

מֶּ֛לֶךְ

subs king st=a

422134

conj and

422135

art the

422136

שָּׂרִ֖ים

subs chief st=a

phrase 7

2_Kings 12:13

769247

phrase 769247 Objc NP

202861

עֵצִים֙

subs tree st=a

202862

conj and

202863

אַבְנֵ֣י

subs stone st=c

202864

מַחְצֵ֔ב

subs hewn stone st=a

phrase 8

702367

phrase 702367 Objc NP

subs whole st=c

subs garment st=a

conj and

subs whole st=c

subs tool st=c

subs skin st=a

conj and

subs whole st=c

subs deed st=c

subs goat st=a

conj and

subs whole st=c

subs tool st=c

subs tree st=a

phrase 9

Isaiah 21:17

778315

phrase 778315 Subj NP

218969

שְׁאָ֧ר

subs rest st=c

218970

מִסְפַּר־

subs number st=c

218971

קֶ֛שֶׁת

subs bow st=c

218972

גִּבֹּורֵ֥י

subs vigorous st=c

218973

בְנֵֽי־

subs son st=c

218974

קֵדָ֖ר

nmpr Kedar st=a

phrase 10

2_Kings 6:17

766610

phrase 766610 Objc NP

198502

סוּסִ֥ים

subs horse st=a

198503

conj and

198504

רֶ֛כֶב

subs chariot st=c

198505

אֵ֖שׁ

subs fire st=a

phrase 11

Isaiah 13:4

776817

phrase 776817 Subj NP

216535

קֹ֠ול

subs sound st=c

216536

שְׁאֹ֞ון

subs roar st=c

216537

מַמְלְכֹ֤ות

subs kingdom st=c

216538

גֹּויִם֙

subs people st=a

216539

נֶֽאֱסָפִ֔ים

adjv gather nif ptca st=a

phrase 12

Isaiah 51:3

785757

phrase 785757 Subj NP

229892

שָׂשֹׂ֤ון

subs rejoicing st=a

229893

conj and

229894

שִׂמְחָה֙

subs joy st=a

phrase 785757 Subj NP

229897

תֹּודָ֖ה

subs thanksgiving st=a

229898

conj and

229899

קֹ֥ול

subs sound st=c

229900

זִמְרָֽה׃ ס

subs melody st=a

phrase 13

2_Chronicles 17:12

898663

phrase 898663 Objc NP

414809

בִּירָנִיֹּ֖ות

subs fortified place st=a

414810

conj and

414811

עָרֵ֥י

subs town st=c

414812

מִסְכְּנֹֽות׃

subs storages st=a

phrase 14

Proverbs 20:20

865119

phrase 865119 Frnt NP

352309

מְ֭קַלֵּל

subs be slight piel ptca st=c

352310

אָבִ֣יו

subs father st=a

352311

conj and

352312

אִמֹּ֑ו

subs mother st=a

phrase 15

Numbers 3:25

692962

phrase 692962 PreC NP

71278

art the

71279

מִּשְׁכָּ֖ן

subs dwelling-place st=a

71280

conj and

71281

art the

71282

אֹ֑הֶל

subs tent st=a

phrase 692962 PreC NP

71283

מִכְסֵ֕הוּ

subs covering st=a

71284

conj and

71285

מָסַ֕ךְ

subs covering st=c

71286

פֶּ֖תַח

subs opening st=c

71287

אֹ֥הֶל

subs tent st=c

71288

מֹועֵֽד׃

subs appointment st=a

phrase 16

Isaiah 20:6

778151

phrase 778151 Subj NP

218701

יֹשֵׁ֨ב

subs sit qal ptca st=a

218702

art the

218703

אִ֣י

subs coast, island st=a

218704

art the

218705

זֶּה֮

prde this

phrase 17

Deuteronomy 28:52

713495

phrase 713495 Subj NP

109069

חֹמֹתֶ֨יךָ֙

subs wall st=a

109070

art the

109071

גְּבֹהֹ֣ות

adjv high st=a

109072

conj and

109073

art the

109074

בְּצֻרֹ֔ות

adjv fortified st=a

phrase 18

Joshua 13:11

719764

phrase 719764 Loca PrNP

120578

art the

120579

גִּלְעָ֞ד

nmpr Gilead st=a

phrase 719764 Loca PrNP|CP

120580

conj and

phrase 719764 Loca PrNP|NP

120581

גְב֧וּל

subs boundary st=c

120582

art the

120583

גְּשׁוּרִ֣י

subs Geshurite st=a

120584

conj and

120585

art the

120586

מַּעֲכָתִ֗י

subs Maacathite st=a

phrase 719764 Loca PrNP|CP

120587

conj and

phrase 719764 Loca PrNP|NP

120588

כֹ֨ל

subs whole st=c

120589

הַ֥ר

subs mountain st=c

120590

חֶרְמֹ֛ון

nmpr Hermon st=a

120591

conj and

120592

subs whole st=c

120593

art the

120594

בָּשָׁ֖ן

nmpr Bashan st=a

phrase 719764 Loca PrNP|PP

120595

prep unto

120596

סַלְכָֽה׃

nmpr Salecah st=a

phrase 19

2_Kings 22:6

772835

phrase 772835 Objc NP

209361

עֵצִים֙

subs tree st=a

209362

conj and

209363

אַבְנֵ֣י

subs stone st=c

209364

מַחְצֵ֔ב

subs hewn stone st=a

phrase 20

2_Kings 7:10

767101

phrase 767101 Subj NP

199226

אִ֖ישׁ

subs man st=a

199227

conj and

199228

קֹ֣ול

subs sound st=c

199229

אָדָ֑ם

subs human, mankind st=a

phrase 21

1_Chronicles 9:13

889426

phrase 889426 PreC NP

396194

גִּבֹּ֣ורֵי

subs vigorous st=c

396195

חֵ֔יל

subs power st=c

396196

מְלֶ֖אכֶת

subs work st=c

396197

עֲבֹודַ֥ת

subs work st=c

396198

subs house st=c

396199

art the

396200

אֱלֹהִֽים׃

subs god(s) st=a

phrase 22

Genesis 15:10

655306

phrase 655306 Objc NP

6839

אִישׁ־

subs man st=a

6840

בִּתְרֹ֖ו

subs piece st=a

phrase 23

1_Samuel 25:18

741511

phrase 741511 Objc NP

156885

מָאתַ֨יִם

subs hundred st=a

156886

לֶ֜חֶם

subs bread st=a

156887

conj and

156888

שְׁנַ֣יִם

subs two st=a

156889

נִבְלֵי־

subs jar st=c

156890

יַ֗יִן

subs wine st=a

156891

conj and

156892

חָמֵ֨שׁ

subs five st=a

156893

צֹ֤אן

subs cattle st=a

156894

עֲשׂוּיֹות֙

adjv make qal ptcp st=a

156895

conj and

156896

חָמֵ֤שׁ

subs five st=a

156897

סְאִים֙

subs seah st=a

156898

קָלִ֔י

subs parched grain st=a

156899

conj and

156900

מֵאָ֥ה

subs hundred st=a

156901

צִמֻּקִ֖ים

subs cakes st=a

156902

conj and

156903

מָאתַ֣יִם

subs hundred st=a

156904

דְּבֵלִ֑ים

subs fig cake st=a

phrase 24

Proverbs 21:17

865280

phrase 865280 Subj NP

352574

אֹהֵ֥ב

subs love qal ptca st=c

352575

יַֽיִן־

subs wine st=a

352576

וָ֝

conj and

352577

שֶׁ֗מֶן

subs oil st=a

phrase 25

1_Kings 8:56

757263

phrase 757263 Subj NP

183384

דָּבָ֣ר

subs word st=a

183385

אֶחָ֗ד

subs one st=a

phrase 757263 Subj NP|PP

183386

prep from

183387

כֹּל֙

subs whole st=c

183388

דְּבָרֹ֣ו

subs word st=a

183389

art the

183390

טֹּ֔וב

adjv good st=a

phrase 26

Nehemiah 8:15

885776

phrase 885776 Objc NP

387601

עֲלֵי־

subs leafage st=c

387602

זַ֨יִת֙

subs olive st=a

387603

conj and

387604

עֲלֵי־

subs leafage st=c

387605

עֵ֣ץ

subs tree st=c

387606

שֶׁ֔מֶן

subs oil st=a

387607

conj and

387608

עֲלֵ֤י

subs leafage st=c

387609

הֲדַס֙

subs myrtle st=a

387610

conj and

387611

עֲלֵ֣י

subs leafage st=c

387612

תְמָרִ֔ים

subs date-palm st=a

387613

conj and

387614

עֲלֵ֖י

subs leafage st=c

387615

עֵ֣ץ

subs tree st=c

387616

עָבֹ֑ת

subs branchy st=a

phrase 27

Jeremiah 9:23

792539

phrase 792539 Objc NP

239841

חֶ֛סֶד

subs loyalty st=a

239842

מִשְׁפָּ֥ט

subs justice st=a

239843

conj and

239844

צְדָקָ֖ה

subs justice st=a

phrase 28

Daniel 8:26

879818

phrase 879818 Frnt NP

375719

מַרְאֵ֨ה

subs sight st=c

375720

art the

375721

עֶ֧רֶב

subs evening st=a

375722

conj and

375723

art the

375724

בֹּ֛קֶר

subs morning st=a

phrase 29

Ezra 4:13

881928

phrase 881928 Objc NP

379774

מִנְדָּֽה־

subs tax st=a

379775

בְלֹ֤ו

subs tribute st=a

379776

conj and

379777

הֲלָךְ֙

subs tax st=a

phrase 30

Psalms 145:5

852591

phrase 852591 Objc NP

335282

הֲ֭דַר

subs ornament st=c

335283

כְּבֹ֣וד

subs weight st=c

335284

הֹודֶ֑ךָ

subs splendour st=a

335285

conj and

335286

דִבְרֵ֖י

subs word st=c

335287

נִפְלְאֹותֶ֣יךָ

subs be miraculous nif ptca st=a

phrase 31

1_Chronicles 9:13

889426

phrase 889426 PreC NP

396194

גִּבֹּ֣ורֵי

subs vigorous st=c

396195

חֵ֔יל

subs power st=c

396196

מְלֶ֖אכֶת

subs work st=c

396197

עֲבֹודַ֥ת

subs work st=c

396198

subs house st=c

396199

art the

396200

אֱלֹהִֽים׃

subs god(s) st=a

phrase 32

1_Chronicles 8:40

889373

phrase 889373 Objc NP

395990

בָּנִים֙

subs son st=a

395991

conj and

395992

בְנֵ֣י

subs son st=c

395993

בָנִ֔ים

subs son st=a

phrase 33

Jeremiah 38:12

801694

phrase 801694 Objc NP

255927

בְּלֹואֵ֨י

subs waste st=c

255928

art the

255929

סְּחָבֹ֤ות

subs rags st=a

255930

conj and

255931

art the

255932

מְּלָחִים֙

subs rag st=a

phrase 34

Exodus 14:9

674072

phrase 674072 Subj NP

36386

subs whole st=c

36387

סוּס֙

subs horse st=c

36388

רֶ֣כֶב

subs chariot st=c

36389

פַּרְעֹ֔ה

subs pharaoh st=a

36390

conj and

36391

פָרָשָׁ֖יו

subs horseman st=a

36392

conj and

36393

חֵילֹ֑ו

subs power st=a

phrase 35

Leviticus 13:30

686550

phrase 686550 PreC NP

59522

צָרַ֧עַת

subs skin-disease st=c

59523

art the

59524

רֹ֛אשׁ

subs head st=a

59525

אֹ֥ו

conj or

59526

art the

59527

זָּקָ֖ן

subs beard st=a

phrase 36

Exodus 39:32

682355

phrase 682355 Subj NP

51629

subs whole st=c

51630

עֲבֹדַ֕ת

subs work st=c

51631

מִשְׁכַּ֖ן

subs dwelling-place st=c

51632

אֹ֣הֶל

subs tent st=c

51633

מֹועֵ֑ד

subs appointment st=a

phrase 37

Ezra 9:2

883150

phrase 883150 Subj NP

382326

יַ֧ד

subs hand st=c

382327

art the

382328

שָּׂרִ֣ים

subs chief st=a

382329

conj and

382330

art the

382331

סְּגָנִ֗ים

subs prefect st=a

phrase 38

2_Chronicles 31:5

902828

phrase 902828 Objc NP

422658

רֵאשִׁ֣ית

subs beginning st=c

422659

דָּגָ֗ן

subs corn st=a

422660

תִּירֹ֤ושׁ

subs wine st=a

422661

conj and

422662

יִצְהָר֙

subs oil st=a

422663

conj and

422664

דְבַ֔שׁ

subs honey st=a

422665

conj and

422666

כֹ֖ל

subs whole st=c

422667

תְּבוּאַ֣ת

subs yield st=c

422668

שָׂדֶ֑ה

subs open field st=a

phrase 39

Proverbs 21:21

865297

phrase 865297 Subj NP

352610

רֹ֭דֵף

subs pursue qal ptca st=c

352611

צְדָקָ֣ה

subs justice st=a

352612

conj and

352613

חָ֑סֶד

subs loyalty st=a

phrase 40

Zephaniah 1:14

830949

phrase 830949 Subj NP

303361

יֹום־

subs day st=c

303362

יְהוָה֙

nmpr YHWH st=a

303363

art the

303364

גָּדֹ֔ול

adjv great st=a

phrase 41

Numbers 24:17

700732

phrase 700732 Objc NP

85505

פַּאֲתֵ֣י

subs corner st=c

85506

מֹואָ֔ב

nmpr Moab st=a

85507

conj and

85508

קַרְקַ֖ר

subs <uncertain> st=c

85509

subs whole st=c

85510

בְּנֵי־

subs son st=c

85511

שֵֽׁת׃

subs defiance st=a

phrase 42

Daniel 8:13

879674

phrase 879674 Subj NP

375490

art the

375491

חָזֹ֤ון

subs vision st=a

375492

art the

375493

תָּמִיד֙

subs continuity st=a

375494

conj and

375495

art the

375496

פֶּ֣שַׁע

subs rebellion st=a

375497

שֹׁמֵ֔ם

adjv be desolate qal ptca st=a

Random Evaluations¶

We have examined methodically the different aspects of the selected heads, including their statistical distribution in terms of parts of speech, as well as looking for mixed or missed heads within quantifiers. Now we turn to a less methodical, but nonetheless important, evaluation measure: that of random sampling and manual inspection. The code below aims to produce the most varied sets of manual sampling, across a wide array of phrase types. Since some phrase types are statistically dominant, the code selects a random phrase type that is then used to select a random node of that type. The algorithm runs until producing 50 random samples, with no duplicates allowed. The sample set can be re-shuffled with new random results by running the first cell before displaying the results. There is also an option to pick results with more than a given number of heads, to show more complex examples.

In [75]:

# RUNNING THIS REMOVES PREVIOUSLY DISPLAYED RESULTS

min_words = 2
min_heads = 1

type2headresults = collections.defaultdict(list) # map types to result nodes
maxlength = collections.defaultdict(lambda: collections.defaultdict(int))
for phrase, heads in phrase2heads.items():
    typ = F.typ.v(phrase)
    type2headresults[typ].append((phrase,)+tuple(heads))
    phrase_len, head_len = len(L.d(phrase,'word')), len(heads)
    if phrase_len > maxlength[typ]['phrase']:
        maxlength[typ]['phrase'] = phrase_len
    if head_len > maxlength[typ]['head']:
        maxlength[typ]['head'] = head_len

samples = set() # to fill with 50
while len(samples) < 50:
        typ = random.choice(list(type2headresults.keys()))
        while maxlength[typ]['phrase'] < min_words or maxlength[typ]['head'] < min_heads: # don't pick type without size threshold
            typ = random.choice(list(type2headresults.keys()))
            
        choice = random.choice(type2headresults[typ])
        while len(L.d(choice[0], 'word')) < min_words or len(choice[1:]) < min_heads:
            choice = random.choice(type2headresults[typ])
        samples.add(choice)
        
samples = list(samples)

In [76]:

A.show(list(samples), condenseType='phrase', withNodes=True)

phrase 1

Isaiah 29:13

779941

phrase 779941 Conj CP

221578

יַ֚עַן

prep motive st=a

221579

כִּ֤י

conj that

phrase 2

Ecclesiastes 4:8

870782

phrase 870782 Subj DPrP

360703

advb even

360704

זֶ֥ה

prde this

phrase 3

Ezra 4:14

881936

phrase 881936 Conj CP

379785

כָּ

prep like

379786

ל־

prep to

379787

קֳבֵל֙

subs opposite st=c

379788

דִּֽי־

conj <relative>

phrase 4

Isaiah 48:14

785092

phrase 785092 Subj IPrP

228997

מִ֥י

prin who

phrase 785092 Subj IPrP|PP

228998

בָהֶ֖ם

prep in

phrase 5

Isaiah 28:29

779762

phrase 779762 Subj DPrP

221316

advb even

221317

זֹ֕את

prde this

phrase 6

Exodus 24:14

677549

phrase 677549 Conj CP

41883

עַ֥ד

prep unto

41884

אֲשֶׁר־

conj <relative>

phrase 7

Numbers 19:12

698605

phrase 698605 Time PP

82258

prep in

82259

art the

82260

יֹּ֧ום

subs day st=a

82261

art the

82262

שְּׁלִישִׁ֛י

adjv third st=a

phrase 8

Joshua 13:14

719794

phrase 719794 Conj CP

120660

כַּ

prep as

120661

אֲשֶׁ֖ר

conj <relative>

phrase 9

1_Samuel 25:14

741460

phrase 741460 Pred VP

156813

prep to

156814

בָרֵ֥ךְ

verb bless piel infc st=a

phrase 10

2_Samuel 19:23

751263

phrase 751263 Nega NegP

172544

הֲ

inrg <interrogative>

172545

לֹ֣וא

nega not

phrase 11

Joshua 16:6

720495

phrase 720495 Cmpl AdvP

122261

art the

122262

יָּ֗מָּה

subs sea st=a

phrase 720495 Cmpl AdvP|PrNP

122263

art the

122264

מִּכְמְתָת֙

nmpr Micmethath st=a

phrase 720495 Cmpl AdvP|PP

122265

prep from

122266

צָּפֹ֔ון

subs north st=a

phrase 12

2_Chronicles 18:23

899066

phrase 899066 PreC InrP

415438

אֵ֣י

inrg where

415439

זֶ֤ה

prde this

phrase 13

Nehemiah 13:22

887298

phrase 887298 Objc DPrP

391058

advb even

391059

זֹאת֙

prde this

phrase 14

Daniel 2:34

876919

phrase 876919 Conj CP

371216

עַ֠ד

prep until

371217

דִּ֣י

conj <relative>

phrase 15

Numbers 27:14

701466

phrase 701466 Conj CP

87150

כַּ

prep as

87151

אֲשֶׁר֩

conj <relative>

phrase 16

2_Kings 20:20

772451

phrase 772451 Cmpl AdvP

208652

art the

208653

עִ֑ירָה

subs town st=a

phrase 17

Psalms 145:5

852591

phrase 852591 Objc NP

335282

הֲ֭דַר

subs ornament st=c

335283

כְּבֹ֣וד

subs weight st=c

335284

הֹודֶ֑ךָ

subs splendour st=a

335285

conj and

335286

דִבְרֵ֖י

subs word st=c

335287

נִפְלְאֹותֶ֣יךָ

subs be miraculous nif ptca st=a

phrase 18

Leviticus 26:16

691577

phrase 691577 Subj PPrP

68387

אַף־

advb even

68388

אֲנִ֞י

prps i

phrase 19

Numbers 5:22

693784

phrase 693784 Intj InjP

73202

אָמֵ֥ן׀

intj surely

73203

intj surely

phrase 20

Psalms 72:19

843764

phrase 843764 Intj InjP

322521

אָ֘מֵ֥ן׀

intj surely

322522

conj and

322523

intj surely

phrase 21

Nehemiah 4:15

884715

phrase 884715 Pred VP

385357

prep from

385358

עֲלֹ֣ות

verb ascend qal infc st=c

phrase 22

Joshua 8:20

718173

phrase 718173 Cmpl AdvP

117314

art the

117315

שָּׁמַ֔יְמָה

subs heavens st=a

phrase 23

Deuteronomy 21:20

711177

phrase 711177 PreC AdjP

105211

סֹורֵ֣ר

adjv rebel qal ptca st=a

105212

conj and

105213

מֹרֶ֔ה

adjv rebel qal ptca st=a

phrase 24

2_Samuel 24:1

752997

phrase 752997 Pred VP

175424

לַ

prep to

175425

חֲרֹ֖ות

verb be hot qal infc st=a

phrase 25

1_Chronicles 28:2

894018

phrase 894018 Subj PrNP

405541

דָּוִ֤יד

nmpr David st=a

phrase 894018 Subj PrNP|NP

405542

art the

405543

מֶּ֨לֶךְ֙

subs king st=a

phrase 26

Psalms 41:14

840175

phrase 840175 Intj InjP

317373

אָ֘מֵ֥ן׀

intj surely

317374

conj and

317375

intj surely

phrase 27

Ezekiel 13:8

810058

phrase 810058 PreS VP

269980

יַ֚עַן

prep motive st=c

269981

דַּבֶּרְכֶ֣ם

verb speak piel infc st=a

phrase 28

Jeremiah 4:22

790605

phrase 790605 Pred VP

236940

prep to

236941

הֵיטִ֖יב

verb be good hif infc st=a

phrase 29

Deuteronomy 21:20

711181

phrase 711181 PreC AdjP

105218

זֹולֵ֖ל

adjv be lavish qal ptca st=a

105219

conj and

105220

סֹבֵֽא׃

adjv drink qal ptca st=a

phrase 30

Judges 4:21

724318

phrase 724318 Subj PrNP

129847

יָעֵ֣ל

nmpr Jael st=a

phrase 724318 Subj PrNP|NP

129848

אֵֽשֶׁת־

subs woman st=c

129849

חֶ֠בֶר

nmpr Heber st=a

phrase 31

Psalms 48:3

840787

phrase 840787 PreC AdjP

318307

יְפֵ֥ה

adjv beautiful st=c

318308

נֹוף֮

subs height st=a

phrase 32

Exodus 29:19

679001

phrase 679001 Subj PrNP

44902

אַהֲרֹ֧ן

nmpr Aaron st=a

44903

conj and

44904

בָנָ֛יו

subs son st=a

phrase 33

Jeremiah 48:47

804596

phrase 804596 Objc NP

261107

שְׁבוּת־

subs captivity st=c

261108

מֹואָ֛ב

nmpr Moab st=a

phrase 34

1_Chronicles 16:37

891336

phrase 891336 Cmpl AdvP

400081

שָׁ֗ם

advb there

phrase 891336 Cmpl AdvP|PP

400082

prep to

400083

פְנֵי֙

subs face st=c

400084

אֲרֹ֣ון

subs ark st=c

400085

בְּרִית־

subs covenant st=a

400086

יְהוָ֔ה

nmpr YHWH st=a

phrase 35

Psalms 35:21

839139

phrase 839139 Intj InjP

316012

הֶאָ֣ח׀

intj aha

316013

הֶאָ֑ח

intj aha

phrase 36

Ecclesiastes 2:15

870266

phrase 870266 Subj DPrP

359883

advb even

359884

זֶ֖ה

prde this

phrase 37

Haggai 2:3

831575

phrase 831575 Subj IPrP

304505

מִ֤י

prin who

phrase 831575 Subj IPrP|PP

304506

בָכֶם֙

prep in

phrase 38

Deuteronomy 33:13

715460

phrase 715460 PreC AdjP

112426

מְבֹרֶ֥כֶת

adjv bless pual ptcp st=c

112427

יְהֹוָ֖ה

nmpr YHWH st=a

phrase 39

Psalms 92:15

846646

phrase 846646 PreC AdjP

326695

דְּשֵׁנִ֖ים

adjv fat st=a

326696

conj and

326697

רַֽעֲנַנִּ֣ים

adjv luxuriant st=a

phrase 40

Isaiah 49:15

785392

phrase 785392 Subj DPrP

229415

advb even

229416

אֵ֣לֶּה

prde these

phrase 41

2_Chronicles 7:21

896504

phrase 896504 Subj NP

410543

art the

410544

בַּ֤יִת

subs house st=a

410545

art the

410546

זֶּה֙

prde this

phrase 42

Psalms 89:53

846326

phrase 846326 Intj InjP

326216

אָ֘מֵ֥ן׀

intj surely

326217

conj and

326218

intj surely

phrase 43

Malachi 2:9

834882

phrase 834882 Objc PP

309952

prep <object marker>

309953

דְּרָכַ֔י

subs way st=a

phrase 44

Joshua 10:11

718882

phrase 718882 Rela CP

118624

prep from

118625

אֲשֶׁ֥ר

conj <relative>

phrase 45

Judges 6:5

724723

phrase 724723 Subj PPrP

130505

הֵם֩

prps they

130506

conj and

130507

מִקְנֵיהֶ֨ם

subs purchase st=a

phrase 46

1_Chronicles 22:5

892832

phrase 892832 Pred VP

402595

prep to

402596

הַגְדִּ֨יל׀

verb be strong hif infc st=a

phrase 47

Judges 8:4

725774

phrase 725774 Cmpl AdvP

132211

art the

132212

יַּרְדֵּ֑נָה

nmpr Jordan st=a

phrase 48

2_Kings 23:27

773436

phrase 773436 Objc PP

210640

prep <object marker>

210641

יְר֣וּשָׁלִַ֔ם

nmpr Jerusalem st=a

210642

conj and

210643

prep <object marker>

210644

art the

210645

בַּ֔יִת

subs house st=a

phrase 49

2_Samuel 15:24

749395

phrase 749395 Subj NP

169458

subs whole st=c

169459

art the

169460

עָ֖ם

subs people st=a

phrase 50

Exodus 29:30

679120

phrase 679120 Time NP

45201

שִׁבְעַ֣ת

subs seven st=c

45202

יָמִ֗ים

subs day st=a

`obj_prep`¶

The feature prep_obj in v.1 was an edge feature from a word to its governing preposition. As is done with the nouns above, this would would be a nominal element that is disambiguated from its quantifiers. Since there is no dependency of a prepositional object, the nominal templates developed above can be used with the single change that the phrase type is a PP or CP (which also has prepositional objects!).

Since v.2 will encode edges from words to phrases rather than the other way around, this feature will encode an edge from the object to the preposition, hence the new feature name.

In [77]:

pp_obj_queries = {}
    
PP_noqant = f'''

phrase_atom
    prep prs=absent
    < head:nonquant pdp#conj|art|prep|nega

% either word is adjacent to prep
    /with/
    phrase_atom
        prep
        <: head
        
% or word is adjacent to prep but interrupted by article
    /or/
    phrase_atom
        prep
        <: word pdp=art
        <: head
    
    /or/

% or word is w1, an independent, non-modifying word
% what follows is a long description for that situation

    w1:word

% exclude w1 uses as modifier
    /without/
    subphrase rela=adj|atr|mod|dem
        w1
    /-/
    /with/
    = iword
    /-/

% exclude w1 rec relations to non-prepositions
    /without/
    nonprep
    <mother- subphrase rela=rec
        w1
    /-/

% ensure w1 is not immediately preceded by a construct form
    /without/
    phrase_atom
        nonprep st=c
        <: w1
    /-/
    

% exclude cases where word occurs in a subphrase immediately before a preposition
% only 1 case of this, but may be other edge cases this misses.
    /without/
    s1:subphrase
        prep
    s2:subphrase rela=par
        w1
        <: prep
    s1 <mother- s2
    /-/
    
    w1 = head
    /-/

'''

pp_obj_queries['PP_noqant'] = {'template': PP_noqant,
                               'prepi': 1,
                               'obji': 2}

PP_quant_alone = f'''

phrase_atom
    prep prs=absent
    < quantifier:quant

% quantifier does not precede a quantified element within a subphrase
    /without/
    subphrase
        quantifier
        < w1:nonquantprep pdp=subs|adjv|advb|nmpr|prde|prps
    
        /without/
        = postprep
        /-/
        
        /without/
        subphrase
        /without/
            quant
        /-/
            prep
            < w1
        /-/
    /-/ 
    

% quantifier not immediately adjacent to quantified element within a phrase_atom
    /without/
    phrase_atom
        quantifier
        <: w1:nonquantprep pdp=subs|nmpr|prde|prps
    /-/
    /without/
    phrase_atom
        quantifier
        <: word pdp=art
        <: w1:nonquantprep pdp=subs|nmpr|prde|prps
    /-/
    /without/
    phrase_atom
        w1:nonquantprep pdp=subs|nmpr|prde|prps
        <: quantifier
    /-/
    /without/
    phrase_atom
        w1:nonquantprep pdp=subs|nmpr|prde|prps
        <: word pdp=art
        <: quantifier
    /-/
        
% quantifier is not construct with quantified element
    /without/
    quantifier
    <mother- subphrase rela=rec
        nonquantprep pdp=subs|adjv|advb|nmpr|prde|prps
    /-/
    /without/
    phrase_atom
        quantifier st=c
        <: nonquantprep pdp=subs|adjv|advb|nmpr|prde|prps
    /-/
    
% quantifier is not in another relation with a quantified element
    /without/
    s1:subphrase
        quantifier
    s2:subphrase rela=adj|atr|dem
        w1:nonquantprep pdp=subs|adjv|advb|nmpr|prde|prps 
        /without/
        = postprep
        /-/
        /without/
        subphrase
        /without/
            quant
        /-/
            < prep
            w1
        /-/
        
    s1 <mother- s2
    /-/

% ensure quantifer is not in a quantifying chain
    /without/
    phrase_atom
    /with/
        s1:subphrase
            nonprep pdp=subs|adjv|nmpr|prde|prps lex#{quantlexs} ls#card
        s2:subphrase rela=adj|atr
            word ls=card
        s1 <mother- s2
    /or/
        s1:subphrase
            word ls=card
        s2:subphrase rela=adj|atr
            w1:nonprep pdp=subs|adjv|nmpr|prde|prps lex#{quantlexs} ls#card
            /without/
            = postprep
            /-/
            /without/
            subphrase
            /without/
                quant
            /-/
                < prep
                w1
            /-/
        s1 <mother- s2
    /-/
        quantifier ls=card prs=absent
    /-/

% exclude uses as modifier:
    /without/
    subphrase rela=adj|atr|rec|mod|dem
        quantifier
        w1:word
        /without/
        = postprep
        /-/
        quantifier = w1
    /-/
    /with/
    = iword
    /-/
'''

pp_obj_queries['PP_quant_alone'] = {'template': PP_quant_alone,
                                    'prepi': 1,
                                    'obji': 2}

PP_quantified = f'''


phrase_atom
    prep prs=absent
   
% ensure that word is quantified with a head-word quantifier
% NB: what follows is a long chain of specs on quantifier

    < quantifier:quant

    /with/
    phrase_atom
        prep
        <: quantifier
    /or/
    
% quantifier not used in rec relations to non-prepositions
    /without/
    nonprep
    <mother- subphrase rela=rec
        quantifier
        w1:word
        /without/
        phrase_atom
            prep
            <: w1
        /-/
        /without/
        phrase_atom
            prep
            <: word pdp=art
            <: w1
        /-/
        w1 = quantifier
    /-/

% quantifier not used in adj relations to non-quantifiers
    /without/
    subphrase
    /with/
        nonquant pdp#conj|art
    /-/
    <mother- subphrase rela=adj
        quantifier
        w1:word
        /without/
        prep
        <: w1
        /-/
        w1 = quantifier
    /-/
    /-/

% ------------------------------
% NB: what follows is a long chain of specs on head

% require adjacency to quantifier
    <1: subphrase
        head:nonquant pdp=subs|adjv|advb|nmpr|prde|prps
        
        /with/
        phrase_atom
            prep
            <: quant
            <: head

        /or/
        phrase_atom
            prep
            <: quant
            <: word pdp=art
            <: head
        
        /or/
    
% quantified word is not a dependent modifier
% exclude construct state to non quants/preps
        /without/
        nonquantprep st=c
        <: head
        /-/
        /without/
        nonquantprep st=c
        <: word pdp=art
        <: head
        /-/

% iword requirements
        /with/
        = iword
        /or/
        quant
        <: ..
        /or/
        quant
        <: word pdp=art
        <: ..
        /or/
        ..
        <: quant
        /-/

% exclude non-quant/prep rec relas
        /without/
        nonquantprep
        <mother- subphrase rela=rec
            head
        /-/
    
% exclude non-quant para rec relas
        /without/
        nonquantprep
        <mother- subphrase rela=rec
        <mother- subphrase rela=par
            head
        /-/
        
% exclude non-quant adjunct relas
        /without/
        subphrase
        /without/
            := quant
        /-/
        <mother- subphrase rela=adj
            head
        /-/
    
% exclude non-quant para adjunct relas
        /without/
        subphrase
        /without/
            := quant
        /-/
        <mother- subphrase rela=adj
        <mother- subphrase rela=par
            head
        /-/

% exclude demonstrative relas when demonstrative points to subphrase with words other than quantifiers
        /without/
        subphrase
        /with/
            nonquant pdp#art|conj
        /-/
        <mother- subphrase rela=dem
            head 
        /-/

% exclude all other kinds of relations
        /without/
        subphrase rela=atr|mod
            head
        /-/
        /-/
'''

pp_obj_queries['PP_quantified'] = {'template': PP_quantified,
                                   'prepi': 1,
                                   'obji': 4}
    
special_quantified = '''

% necessary due to technical limitation in search patterns
phrase_atom
    prep
    <: quant
    <: nonquant pdp=subs|adjv|advb|nmpr|prde|prps
'''
pp_obj_queries['special_quantified'] = {'template': special_quantified,
                                        'prepi': 1,
                                        'obji': 3}
    
PP_to_PP = '''

phrase_atom
    prep
    <: prep
'''

pp_obj_queries['PP_to_PP'] = {'template': PP_to_PP,
                              'prepi': 1,
                              'obji': 2}
    
PP_to_conj = '''

phrase_atom
    prep
    <: word pdp=conj
    
    /with/
    phrase_atom typ=CP
        ..
    /or/
    lex=C|>CR
    /-/
'''

pp_obj_queries['PP_to_conj'] = {'template': PP_to_conj,
                                'prepi': 1,
                                'obji': 2}

PP_negation = '''

pa:phrase_atom
    pp:prep
    neg:word pdp=nega

pa =: pp
pa := neg
pp # neg
'''
pp_obj_queries['PP_negation'] = {'template': PP_negation,
                                'prepi': 1,
                                'obji': 2}



obj2prep = collections.defaultdict()
prep2obj = collections.defaultdict(set)

for name, query in pp_obj_queries.items():
    template = query['template']
    prepi = query['prepi']
    obji = query['obji']
    
    print(f'running query on {name}...')
    results = A.search(template, sets=sets)

    print('\tprocessing prepositions...')
    for res in results:
        obj = res[obji]
        # back up one slot until a preposition is found
        prep = None
        cur_slot = obj
        while not prep:
            cur_slot -= 1
            if cur_slot in preps:
                prep = cur_slot
                
        obj2prep[obj] = prep
        prep2obj[prep].add(obj)
        
print('\n', '<>'*20, '\n')
print(f'queries complete with {len(obj2prep)} object of preposition mappings...')

running query on PP_noqant...
  1.87s 66077 results
	processing prepositions...
running query on PP_quant_alone...
  0.12s 829 results
	processing prepositions...
running query on PP_quantified...
  1.50s 5142 results
	processing prepositions...
running query on special_quantified...
  0.62s 1959 results
	processing prepositions...
running query on PP_to_PP...
  0.22s 2917 results
	processing prepositions...
running query on PP_to_conj...
  0.54s 1055 results
	processing prepositions...
running query on PP_negation...
  0.62s 1 result
	processing prepositions...

 <><><><><><><><><><><><><><><><><><><><> 

queries complete with 64217 object of preposition mappings...

Check for Missing Prepositional Objects¶

In [78]:

# for testing:
for sp in L.d(0, 'subphrase'):
    print(sp, F.rela.v(sp), E.mother.f(sp), T.text(sp))
    print(L.d(sp, 'word'))
    print()

In [79]:

simple_check_results = A.search('''

phrase_atom
    prep prs=absent
    <: word pdp#conj
    
''', sets={'prep': preps})

simple_check_prep = [res for res in simple_check_results if res[1] not in prep2obj]

print(f'{len(simple_check_prep)} prepositions missing...')

A.show(simple_check_prep, withNodes=True, condenseType='phrase_atom', end=100)

  1.07s 62424 results
0 prepositions missing...

Random Inspection¶

In [80]:

random_results = [(L.u(obj, 'phrase_atom')[0], prep, obj) for obj, prep in obj2prep.items()
                     if len(prep2obj[prep]) > 1
                 ]
random.shuffle(random_results)
random_results = [res for res in random_results 
                      #if len(L.d(res[0], 'word')) > 5
                 ]

random_results.insert(0, (898847,)+tuple(prep2obj[898847])) # <- suspect
len(random_results)

Out[80]:

In [81]:

for i in range(0, 50):
    
    res = random_results[i]
    show = {}
    
    for word in L.d(res[0], 'word'):
        if word in obj2prep:
            
            show[word] = 'pink'
            show[obj2prep[word]] = 'lightblue'

    result = (res[0],) + tuple(show.keys())
                    
    A.prettyTuple(result, condenseType='phrase_atom', withNodes=True, seqNumber=i, highlights=show)

Result 0

2_Chronicles 18:9

898847

2_Chronicles 18:9

phrase 898847 Cmpl PP

415125

prep in

415126

גֹ֔רֶן

subs threshing-floor st=a

415127

פֶּ֖תַח

subs opening st=c

415128

subs gate st=c

415129

שֹׁמְרֹ֑ון

nmpr Samaria st=a

Result 1

2_Chronicles 32:28

phrase 903348 Adju PP

423752

prep to

423753

subs whole st=c

423754

בְּהֵמָ֣ה

subs cattle st=a

423755

conj and

423756

בְהֵמָ֔ה

subs cattle st=a

Result 2

Ezra 7:7

phrase 882611 Subj PP

381025

prep from

381026

subs son st=c

381027

יִ֠שְׂרָאֵל

nmpr Israel st=a

381028

conj and

381029

prep from

381030

art the

381031

כֹּהֲנִ֨ים

subs priest st=a

381032

conj and

381033

art the

381034

לְוִיִּ֜ם

subs Levite st=a

381035

conj and

381036

art the

381037

מְשֹׁרְרִ֧ים

subs sing piel ptca st=a

381038

conj and

381039

art the

381040

שֹּׁעֲרִ֛ים

subs porter st=a

381041

conj and

381042

art the

381043

נְּתִינִ֖ים

subs temple slave st=a

Result 3

Esther 8:11

phrase 875744 PreC PP

368985

prep in

368986

subs whole st=c

368987

עִיר־

subs town st=a

368988

conj and

368989

עִ֗יר

subs town st=a

Result 4

Isaiah 66:2

phrase 789022 Cmpl PP

234497

prep to

234498

עָנִי֙

subs humble st=a

234499

conj and

234500

נְכֵה־

adjv smitten st=c

234501

ר֔וּחַ

subs wind st=a

234502

conj and

234503

חָרֵ֖ד

subs trembling st=a

Result 5

Psalms 91:13

phrase 846531 Cmpl PP

326521

prep upon

326522

שַׁ֣חַל

subs young lion st=a

326523

conj and

326524

פֶ֣תֶן

subs cobra st=a

Result 6

Ezekiel 9:9

phrase 809020 Adju PP

268313בִּ
 prep in
268314מְאֹ֣ד 
 subs might st=a
268315מְאֹ֔ד 
 subs might st=a

Result 7

Exodus 33:2

phrase 680324 Objc PP

47310

prep <object marker>

47311

art the

47312

כְּנַעֲנִי֙

subs Canaanite st=a

47313

art the

47314

אֱמֹרִ֔י

subs Amorite st=a

47315

conj and

47316

art the

47317

חִתִּי֙

subs Hittite st=a

47318

conj and

47319

art the

47320

פְּרִזִּ֔י

subs Perizzite st=a

47321

art the

47322

חִוִּ֖י

subs Hivite st=a

47323

conj and

47324

art the

47325

יְבוּסִֽי׃

subs Jebusite st=a

Result 8

Nehemiah 6:1

phrase 885019 Cmpl PP

385900

prep to

385901

סַנְבַלַּ֣ט

nmpr Sanballat st=a

385902

וְ֠

conj and

385903

טֹובִיָּה

nmpr Tobijah st=a

385904

conj and

385905

prep to

385906

גֶ֨שֶׁם

nmpr Geshem st=a

Result 9

Genesis 32:8

phrase 662366 Objc PP

17576

prep <object marker>

17577

art the

17578

צֹּ֧אן

subs cattle st=a

17579

conj and

17580

prep <object marker>

17581

art the

17582

בָּקָ֛ר

subs cattle st=a

17583

conj and

17584

art the

17585

גְּמַלִּ֖ים

subs camel st=a

Result 10

Psalms 150:4

phrase 853039 Adju PP

336002

prep in

336003

מִנִּ֥ים

subs string st=a

336004

conj and

336005

עוּגָֽב׃

subs flute st=a

Result 11

Ezekiel 35:7

phrase 818024 Cmpl PP

282262

prep to

282263

שִֽׁמְמָ֖ה

subs <uncertain> st=a

282264

conj and

282265

שְׁמָמָ֑ה

subs desolation st=a

Result 12

2_Chronicles 14:2

phrase 898014 Objc PP

413511

prep <object marker>

413512

מִזְבְּחֹ֥ות

subs altar st=c

413513

art the

413514

נֵּכָ֖ר

subs foreigner st=a

413515

conj and

413516

art the

413517

בָּמֹ֑ות

subs high place st=a

Result 13

2_Chronicles 31:19

phrase 902957 PreC PP

423003

prep in

423004

subs whole st=c

423005

עִ֣יר

subs town st=a

423006

conj and

423007

עִ֔יר

subs town st=a

Result 14

1_Samuel 17:36

phrase 738018 Objc PP

151573

גַּ֧ם

advb even

151574

prep <object marker>

151575

art the

151576

אֲרִ֛י

subs lion st=a

151577

advb even

151578

art the

151579

דֹּ֖וב

subs bear st=a

Result 15

Joshua 17:11

phrase 720659 Subj PrNP|PP

122647

prep <object marker>

122648

יֹשְׁבֵ֧י

subs sit qal ptca st=c

122649

דֹ֣אר

nmpr Dor st=a

122650

conj and

122651

בְנֹותֶ֗יהָ

subs daughter st=a

122652

conj and

122653

יֹשְׁבֵ֤י

subs sit qal ptca st=c

122654

עֵֽין־דֹּר֙

nmpr Endor st=a

122655

conj and

122656

בְנֹתֶ֔יהָ

subs daughter st=a

122657

conj and

122658

יֹשְׁבֵ֤י

subs sit qal ptca st=c

122659

תַעְנַךְ֙

nmpr Taanach st=a

122660

conj and

122661

בְנֹתֶ֔יהָ

subs daughter st=a

122662

conj and

122663

יֹשְׁבֵ֥י

subs sit qal ptca st=c

122664

מְגִדֹּ֖ו

nmpr Megiddo st=a

122665

conj and

122666

בְנֹותֶ֑יהָ

subs daughter st=a

Result 16

Deuteronomy 3:10

phrase 705255 Adju NP|PP

94689

prep unto

94690

סַלְכָ֖ה

nmpr Salecah st=a

94691

conj and

94692

אֶדְרֶ֑עִי

nmpr Edrei st=a

Result 17

2_Kings 13:23

phrase 769651 Adju PP

203569

prep together with

203570

אַבְרָהָ֖ם

nmpr Abraham st=a

203571

יִצְחָ֣ק

nmpr Isaac st=a

203572

conj and

203573

יַֽעֲקֹ֑ב

nmpr Jacob st=a

Result 18

Judges 8:26

phrase 726056 Adju PP

132686

prep from

132687

art the

132688

שַּׂהֲרֹנִ֨ים

subs <ornament> st=a

132689

conj and

132690

art the

132691

נְּטִפֹ֜ות

subs eardrops st=a

132692

conj and

132693

בִגְדֵ֣י

subs garment st=c

132694

art the

132695

אַרְגָּמָ֗ן

subs purple-wool st=a

Result 19

Ezra 8:26

phrase 883046 Objc NP

382010

כְלֵי־

subs tool st=c

382011

כֶ֥סֶף

subs silver st=a

382012

מֵאָ֖ה

subs hundred st=a

382013

prep to

382014

כִכָּרִ֑ים

subs disk st=a

382015

זָהָ֖ב

subs gold st=a

382016

מֵאָ֥ה

subs hundred st=a

382017

כִכָּֽר׃

subs disk st=a

Result 20

Nehemiah 4:7

phrase 884612 Adju PP

385158

עִם־

prep with

385159

חַרְבֹתֵיהֶ֛ם

subs dagger st=a

385160

רָמְחֵיהֶ֖ם

subs lance st=a

385161

conj and

385162

קַשְּׁתֹתֵיהֶֽם׃

subs bow st=a

Result 21

Numbers 30:5

phrase 702034 Objc PP

88618

prep <object marker>

88619

נִדְרָ֗הּ

subs vow st=a

88620

וֶֽ

conj and

88621

אֱסָרָהּ֙

subs obligation st=a

Result 22

Deuteronomy 31:28

phrase 714754 Objc PP

111365

prep <object marker>

111366

subs whole st=c

111367

זִקְנֵ֥י

subs old st=c

111368

שִׁבְטֵיכֶ֖ם

subs rod st=a

111369

conj and

111370

שֹׁטְרֵיכֶ֑ם

subs register qal ptca st=a

Result 23

2_Chronicles 15:9

phrase 898271 Objc PP

413983

prep <object marker>

413984

subs whole st=c

413985

יְהוּדָה֙

nmpr Judah st=a

413986

conj and

413987

בִנְיָמִ֔ן

nmpr Benjamin st=a

413988

conj and

413989

art the

413990

גָּרִים֙

subs dwell qal ptca st=a

Result 24

1_Kings 16:13

phrase 760562 Adju PP

189314

אֶ֚ל

prep to

189315

subs whole st=c

189316

חַטֹּ֣אות

subs sin st=c

189317

בַּעְשָׁ֔א

nmpr Baasha st=a

189318

conj and

189319

חַטֹּ֖אות

subs sin st=c

189320

אֵלָ֣ה

nmpr Elah st=a

Result 25

1_Samuel 4:21

phrase 733119 Adju PP

143808

prep to

143809

חָמִ֖יהָ

subs father-in-law st=a

143810

conj and

143811

אִישָֽׁהּ׃

subs man st=a

Result 26

1_Kings 16:23

phrase 760685 Time PP

189546

בִּ

prep in

189547

שְׁנַת֩

subs year st=c

189548

שְׁלֹשִׁ֨ים

subs three st=a

189549

conj and

189550

אַחַ֜ת

subs one st=c

189551

שָׁנָ֗ה

subs year st=a

Result 27

Exodus 33:2

phrase 680324 Objc PP

47310

prep <object marker>

47311

art the

47312

כְּנַעֲנִי֙

subs Canaanite st=a

47313

art the

47314

אֱמֹרִ֔י

subs Amorite st=a

47315

conj and

47316

art the

47317

חִתִּי֙

subs Hittite st=a

47318

conj and

47319

art the

47320

פְּרִזִּ֔י

subs Perizzite st=a

47321

art the

47322

חִוִּ֖י

subs Hivite st=a

47323

conj and

47324

art the

47325

יְבוּסִֽי׃

subs Jebusite st=a

Result 28

1_Chronicles 4:28

phrase 888325 Cmpl PP

393225

בִּ

prep in

393226

בְאֵֽר־

subs well st=c

393227

שֶׁ֥בַע

nmpr Sheba st=a

393228

conj and

393229

מֹולָדָ֖ה

nmpr Moladah st=a

393230

conj and

393231

חֲצַ֥ר שׁוּעָֽל׃

nmpr Hazar Shual st=a

Result 29

Hosea 3:2

phrase 822967 Adju PP

291615

prep in

291616

חֲמִשָּׁ֥ה

subs five st=a

291617

subs -teen st=a

291618

כָּ֑סֶף

subs silver st=a

291619

conj and

291620

חֹ֥מֶר

subs homer st=c

291621

שְׂעֹרִ֖ים

subs barley st=a

291622

conj and

291623

לֵ֥תֶךְ

subs letek st=c

291624

שְׂעֹרִֽים׃

subs barley st=a

Result 30

Esther 4:5

phrase 874773 Adju PP

367247עַל־
 prep upon
367248מַה־
 prin what
367249זֶּֽה׃ 
 prde this

Result 31

Joshua 1:4

phrase 715766 Cmpl PP

112987

prep from

112988

art the

112989

מִּדְבָּר֩

subs desert st=a

112990

conj and

112991

art the

112992

לְּבָנֹ֨ון

nmpr Lebanon st=a

112993

art the

112994

זֶּ֜ה

prde this

112995

וְֽ

conj and

112996

prep unto

112997

art the

112998

נָּהָ֧ר

subs stream st=a

112999

art the

113000

גָּדֹ֣ול

adjv great st=a

Result 32

2_Chronicles 36:10

phrase 904594 Cmpl PP

426259

prep upon

426260

יְהוּדָ֖ה

nmpr Judah st=a

426261

conj and

426262

ירוּשָׁלִָֽם׃ פ

nmpr Jerusalem st=a

Result 33

Isaiah 57:15

phrase 787114 Cmpl PP

231803

prep together with

231804

דַּכָּא֙

subs crushed st=a

231805

conj and

231806

שְׁפַל־

adjv low st=c

231807

ר֔וּחַ

subs wind st=a

Result 34

Ezra 8:29

phrase 883065 Adju PP

382062

prep to

382063

פְנֵי֩

subs face st=c

382064

שָׂרֵ֨י

subs chief st=c

382065

art the

382066

כֹּהֲנִ֧ים

subs priest st=a

382067

conj and

382068

art the

382069

לְוִיִּ֛ם

subs Levite st=a

382070

conj and

382071

שָׂרֵֽי־

subs chief st=c

382072

art the

382073

אָבֹ֥ות

subs father st=a

Result 35

Joshua 8:17

phrase 718112 Cmpl PP

117221

בָּ

prep in

117222

art the

117223

עַי֙

nmpr Ai st=a

117224

conj and

117225

בֵ֣ית אֵ֔ל

nmpr Bethel st=a

Result 36

Jeremiah 52:31

phrase 806681 Time PP

264659

prep in

264660

subs twenty st=a

264661

conj and

264662

חֲמִשָּׁ֖ה

subs five st=a

Result 37

Ezra 2:59

phrase 881541 Cmpl PP

378886מִ
 prep from
378887תֵּ֥ל מֶ֨לַח֙ 
 nmpr Tel Melah st=a
378888תֵּ֣ל חַרְשָׁ֔א 
 nmpr Tel Harsha st=a
378889כְּר֥וּב 
 nmpr Kerub st=a
378890אַדָּ֖ן 
 nmpr Addon st=a
378891אִמֵּ֑ר 
 nmpr Immer st=a

Result 38

1_Kings 10:29

phrase 758021 Adju PP

184998

prep in

184999

חֲמִשִּׁ֣ים

subs five st=a

185000

conj and

185001

מֵאָ֑ה

subs hundred st=a

Result 39

Isaiah 35:7

phrase 781319 PreC PP

223671

prep to

223672

קָנֶ֥ה

subs reed st=a

223673

conj and

223674

גֹֽמֶא׃

subs papyrus st=a

Result 40

Psalms 98:6

phrase 847141 Adju PP

327418

בַּ֭

prep in

327419

חֲצֹ֣צְרֹות

subs clarion st=a

327420

conj and

327421

קֹ֣ול

subs sound st=c

327422

שֹׁופָ֑ר

subs horn st=a

Result 41

Genesis 19:25

phrase 656683 Objc PP

8970

אֵת֙

prep <object marker>

8971

subs whole st=c

8972

יֹשְׁבֵ֣י

subs sit qal ptca st=c

8973

art the

8974

עָרִ֔ים

subs town st=a

8975

conj and

8976

צֶ֖מַח

subs sprout st=c

8977

art the

8978

אֲדָמָֽה׃

subs soil st=a

Result 42

Isaiah 40:17

phrase 782655 Cmpl PP

225832

prep from

225833

אֶ֥פֶס

subs end st=a

225834

conj and

225835

תֹ֖הוּ

subs emptiness st=a

Result 43

Isaiah 35:7

phrase 781319 PreC PP

223671

prep to

223672

קָנֶ֥ה

subs reed st=a

223673

conj and

223674

גֹֽמֶא׃

subs papyrus st=a

Result 44

Proverbs 16:6

phrase 864250 Cmpl PP

351062

prep in

351063

חֶ֣סֶד

subs loyalty st=a

351064

וֶ֭

conj and

351065

אֱמֶת

subs trustworthiness st=a

Result 45

Ezra 8:17

phrase 882960 Cmpl PP

381801

prep to

381802

אִדֹּ֨ו

nmpr Iddo st=a

381803

אָחִ֤יו

subs brother st=a

381804

art the

381805

נְּתִינִים֙

subs temple slave st=a

Result 46

Isaiah 2:12

phrase 774538 Cmpl PP

212739

עַ֥ל

prep upon

212740

subs whole st=c

212741

גֵּאֶ֖ה

subs haughty st=a

212742

conj and

212743

רָ֑ם

subs be high qal ptca st=a

Result 47

2_Chronicles 30:11

phrase 902577 Subj NP|PP

422105

prep from

422106

אָשֵׁ֥ר

nmpr Asher st=a

422107

conj and

422108

מְנַשֶּׁ֖ה

nmpr Manasseh st=a

422109

conj and

422110

prep from

422111

זְּבֻל֑וּן

nmpr Zebulun st=a

Result 48

Nehemiah 10:32

phrase 886486 Objc PP

388959

prep <object marker>

388960

art the

388961

שָּׁנָ֥ה

subs year st=a

388962

art the

388963

שְּׁבִיעִ֖ית

adjv seventh st=a

388964

conj and

388965

מַשָּׁ֥א

subs claim st=c

388966

subs whole st=c

388967

יָֽד׃

subs hand st=a

Result 49

Joshua 8:35

phrase 718384 Cmpl PP

117724

נֶ֣גֶד

prep counterpart st=c

117725

subs whole st=c

117726

קְהַ֤ל

subs assembly st=c

117727

יִשְׂרָאֵל֙

nmpr Israel st=a

117728

conj and

117729

art the

117730

נָּשִׁ֣ים

subs woman st=a

117731

conj and

117732

art the

117733

טַּ֔ף

subs <those unable to march> st=a

117734

conj and

117735

art the

117736

גֵּ֖ר

subs sojourner st=a

`nheads`¶

In many cases one does not want to go through prepositions to reach the nominal head elements (i.e. independent substantive, adjective, etc.) in a phrase. For this we can export an additional feature, called nheads ("nominal heads"), which simply ignores any prepositions and selects the nominal elements from the phrase and phrase atoms. This feature is built up using the phrase2heads and prep2obj features already calculated above.

Note on `AdjP`¶

This feature does not select nominals that are embedded within an adjective phrase (AdjP), but those can be selected with the following pattern:

In [82]:

adj_nhead = '''

phrase_atom typ=AdjP
    w1:word 
    /with/
    word pdp=adjv
    <mother- subphrase rela=rec
        w1
    /-/

'''

A.show(A.search(adj_nhead), condenseType='phrase_atom', end=5)

  1.08s 201 results

phrase_atom 1

Genesis 22:12

phrase 657686 PreC AdjP

10502

יְרֵ֤א

adjv afraid st=c

10503

אֱלֹהִים֙

subs god(s) st=a

phrase_atom 2

Genesis 24:16

phrase 658227 PreC AdjP

11445

טֹבַ֤ת

adjv good st=c

11446

מַרְאֶה֙

subs sight st=a

11447

מְאֹ֔ד

advb might st=a

phrase_atom 3

Genesis 26:7

phrase 659299 PreC AdjP

13120

טֹובַ֥ת

adjv good st=c

13121

מַרְאֶ֖ה

subs sight st=a

phrase_atom 4

Genesis 29:17

phrase 660805 PreC AdjP

15276

יְפַת־

adjv beautiful st=c

15277

תֹּ֖אַר

subs form st=a

15278

וִ

conj and

15279

יפַ֥ת

adjv beautiful st=c

15280

מַרְאֶֽה׃

subs sight st=a

phrase_atom 5

Genesis 29:17

phrase 660805 PreC AdjP

15276

יְפַת־

adjv beautiful st=c

15277

תֹּ֖אַר

subs form st=a

15278

וִ

conj and

15279

יפַ֥ת

adjv beautiful st=c

15280

מַרְאֶֽה׃

subs sight st=a

In [83]:

def find_prep_nominal(preposition, nominals=[]):
    '''
    This function recursively
    moves through prepositional
    chains to obtain the ultimate 
    governed nominal element.
    '''
    objects = prep2obj.get(preposition, None)
    if objects:
        for obj in objects:
            if obj not in sets['prep']:
                nominals.append(obj)
            else:
                find_prep_nominal(obj, nominals=nominals)

In [84]:

nheads = collections.defaultdict(set)

for phrase, heads in phrase2heads.items():
    for head in heads:
        if head not in sets['prep']:
            nheads[phrase].add(head)
        else:
            nominals = []
            find_prep_nominal(head, nominals=nominals)
            if nominals:
                nheads[phrase] |= set(nominals)
            
print(f'{len(nheads)} nheads assigned...')
print(f'{len(phrase2heads)-len(nheads)} phrases not assigned an nhead...')

240412 nheads assigned...
12795 phrases not assigned an nhead...

In [85]:

examples = [(phrase,)+tuple(heads) for phrase, heads in nheads.items()
               #if F.typ.v(phrase) == 'PP'
                if len(heads) > 1
           ]

random.shuffle(examples)

In [86]:

for res in examples[:50]:
    A.prettyTuple(res, condenseType='phrase', withNodes=True, seqNumber=res[0])

Result 698662

Numbers 19:16

698662

phrase 698662 Cmpl PP

82350

בַּֽ

prep in

82351

חֲלַל־

adjv pierced st=c

82352

חֶ֨רֶב֙

subs dagger st=a

82353

conj or

82354

prep in

82355

מֵ֔ת

subs die qal ptca st=a

82356

אֹֽו־

conj or

82357

prep in

82358

עֶ֥צֶם

subs bone st=c

82359

אָדָ֖ם

subs human, mankind st=a

82360

conj or

82361

prep in

82362

קָ֑בֶר

subs grave st=a

Result 655979

Genesis 18:8

655979

phrase 655979 Objc NP

7934

חֶמְאָ֜ה

subs butter st=a

7935

conj and

7936

חָלָ֗ב

subs milk st=a

7937

conj and

7938

בֶן־

subs son st=c

7939

art the

7940

בָּקָר֙

subs cattle st=a

Result 864878

Proverbs 19:14

864878

phrase 864878 Subj NP

351974

בַּ֣יִת

subs house st=a

351975

וָ֭

conj and

351976

הֹון

subs abundance st=a

Result 721872

Joshua 22:11

721872

phrase 721872 Subj NP

125528

בְנֵֽי־

subs son st=c

125529

רְאוּבֵ֣ן

nmpr Reuben st=a

125530

conj and

125531

בְנֵי־

subs son st=c

125532

גָ֡ד

nmpr Gad st=a

125533

conj and

125534

חֲצִי֩

subs half st=c

125535

שֵׁ֨בֶט

subs rod st=c

125536

art the

125537

מְנַשֶּׁ֜ה

nmpr Manasseh st=a

Result 883894

Nehemiah 2:4

883894

phrase 883894 Adju PP

383796

prep upon

383797

מַה־

prin what

383798

זֶּ֖ה

prde this

Result 701876

Numbers 29:13

701876

phrase 701876 Objc NP

88096

פָּרִ֧ים

subs young bull st=a

88097

בְּנֵי־

subs son st=c

88098

בָקָ֛ר

subs cattle st=a

88099

שְׁלֹשָׁ֥ה

subs three st=a

88100

subs -teen st=a

88101

אֵילִ֣ם

subs ram, despot st=a

88102

שְׁנָ֑יִם

subs two st=a

88103

כְּבָשִׂ֧ים

subs young ram st=a

88104

subs son st=c

88105

שָׁנָ֛ה

subs year st=a

88106

אַרְבָּעָ֥ה

subs four st=a

88107

subs -teen st=a

Result 875726

Esther 8:9

875726

phrase 875726 Adju PP

368941

כִּ

prep as

368942

כְתָבָ֖ם

subs writing st=a

368943

conj and

368944

כִ

prep as

368945

לְשֹׁונָֽם׃

subs tongue st=a

Result 693094

Numbers 3:48

693094

phrase 693094 Cmpl PP

71706

prep to

71707

אַהֲרֹ֖ן

nmpr Aaron st=a

71708

conj and

71709

prep to

71710

בָנָ֑יו

subs son st=a

Result 812114

Ezekiel 18:19

812114

phrase 812114 Objc NP

273036

מִשְׁפָּ֧ט

subs justice st=a

273037

conj and

273038

צְדָקָ֣ה

subs justice st=a

Result 901662

2_Chronicles 27:7

901662

phrase 901662 Frnt NP

420279

יֶתֶר֙

subs remainder st=c

420280

דִּבְרֵ֣י

subs word st=c

420281

יֹותָ֔ם

nmpr Jotham st=a

420282

conj and

420283

subs whole st=c

420284

מִלְחֲמֹתָ֖יו

subs war st=a

420285

conj and

420286

דְרָכָ֑יו

subs way st=a

Result 707604

Deuteronomy 9:22

707604

phrase 707604 Loca PP

98946

prep in

98947

תַבְעֵרָה֙

nmpr Taberah st=a

98948

conj and

98949

prep in

98950

מַסָּ֔ה

nmpr Massah st=a

98951

conj and

98952

prep in

98953

קִבְרֹ֖ת הַֽתַּאֲוָ֑ה

nmpr Kibroth Hattaavah st=a

Result 672141

Exodus 8:27

672141

phrase 672141 Cmpl PP

32994

prep from

32995

פַּרְעֹ֖ה

subs pharaoh st=a

32996

prep from

32997

עֲבָדָ֣יו

subs servant st=a

32998

conj and

32999

prep from

33000

עַמֹּ֑ו

subs people st=a

Result 840537

Psalms 45:4

840537

phrase 840537 Objc NP

317918

הֹ֝ודְךָ֗

subs splendour st=a

317919

conj and

317920

הֲדָרֶֽךָ׃

subs ornament st=a

Result 698440

Numbers 18:30

698440

phrase 698440 Adju PP

81961

כִּ

prep as

81962

תְבוּאַ֥ת

subs yield st=c

81963

גֹּ֖רֶן

subs threshing-floor st=a

81964

conj and

81965

כִ

prep as

81966

תְבוּאַ֥ת

subs yield st=c

81967

יָֽקֶב׃

subs pit st=a

Result 701931

Numbers 29:26

701931

phrase 701931 Objc NP

88322

פָּרִ֥ים

subs young bull st=a

88323

תִּשְׁעָ֖ה

subs nine st=a

88324

אֵילִ֣ם

subs ram, despot st=a

88325

שְׁנָ֑יִם

subs two st=a

88326

כְּבָשִׂ֧ים

subs young ram st=a

88327

subs son st=c

88328

שָׁנָ֛ה

subs year st=a

88329

אַרְבָּעָ֥ה

subs four st=a

88330

subs -teen st=a

88331

תְּמִימִֽם׃

adjv complete st=a

Result 898142

2_Chronicles 14:12

898142

phrase 898142 Subj PrNP

413746

אָסָ֜א

nmpr Asa st=a

413747

conj and

413748

art the

413749

עָ֣ם

subs people st=a

Result 820857

Ezekiel 43:15

820857

phrase 820857 Adju PP

287472

prep from

287473

art the

287474

אֲרִיאֵ֣ל

subs fire-place st=a

287475

conj and

287476

prep to

287477

מַ֔עְלָה

subs top st=a

Result 831558

Haggai 1:14

831558

phrase 831558 Objc PP

304409

prep <object marker>

304410

רוּחַ֩

subs wind st=c

304411

זְרֻבָּבֶ֨ל

nmpr Zerubbabel st=a

phrase 831558 Objc PP|NP

304412

בֶּן־

subs son st=c

304413

שַׁלְתִּיאֵ֜ל

nmpr Shealtiel st=a

phrase 831558 Objc PP|NP

304414

פַּחַ֣ת

subs governor st=c

304415

יְהוּדָ֗ה

nmpr Judah st=a

phrase 831558 Objc PP|CP

304416

conj and

phrase 831558 Objc PP

304417

prep <object marker>

304418

ר֨וּחַ֙

subs wind st=c

304419

יְהֹושֻׁ֤עַ

nmpr Joshua st=a

phrase 831558 Objc PP|NP

304420

בֶּן־

subs son st=c

304421

יְהֹוצָדָק֙

nmpr Jehozadak st=a

phrase 831558 Objc PP|NP

304422

art the

304423

כֹּהֵ֣ן

subs priest st=a

304424

art the

304425

גָּדֹ֔ול

adjv great st=a

phrase 831558 Objc PP|CP

304426

וְֽ

conj and

phrase 831558 Objc PP

304427

prep <object marker>

304428

ר֔וּחַ

subs wind st=c

304429

כֹּ֖ל

subs whole st=c

304430

שְׁאֵרִ֣ית

subs rest st=c

304431

art the

304432

עָ֑ם

subs people st=a

Result 724693

Judges 6:2

724693

phrase 724693 Objc PP

130449

prep <object marker>

130450

art the

130451

מִּנְהָרֹות֙

subs store st=a

phrase 724693 Objc PP|CP

130456

conj and

phrase 724693 Objc PP

130457

prep <object marker>

130458

art the

130459

מְּעָרֹ֖ות

subs cave st=a

130460

conj and

130461

prep <object marker>

130462

art the

130463

מְּצָדֹֽות׃

subs unapproachable st=a

Result 736026

1_Samuel 14:6

736026

phrase 736026 Cmpl PP

148468

prep in

148469

רַ֖ב

subs much st=a

148470

אֹ֥ו

conj or

148471

בִ

prep in

148472

מְעָֽט׃

subs little st=a

Result 669497

Genesis 50:24

669497

phrase 669497 Cmpl PP

28721

prep to

28722

אַבְרָהָ֥ם

nmpr Abraham st=a

28723

prep to

28724

יִצְחָ֖ק

nmpr Isaac st=a

28725

וּֽ

conj and

28726

prep to

28727

יַעֲקֹֽב׃

nmpr Jacob st=a

Result 678985

Exodus 29:17

678985

phrase 678985 Cmpl PP

44870

prep upon

44871

נְתָחָ֖יו

subs piece st=a

44872

conj and

44873

prep upon

44874

רֹאשֹֽׁו׃

subs head st=a

Result 825493

Joel 4:15

825493

phrase 825493 Subj NP

295254

שֶׁ֥מֶשׁ

subs sun st=a

295255

conj and

295256

יָרֵ֖חַ

subs moon st=a

Result 672371

Exodus 9:19

672371

phrase 672371 Objc PP

33408

prep <object marker>

33409

מִקְנְךָ֔

subs purchase st=a

33410

conj and

33411

אֵ֛ת

prep <object marker>

33412

subs whole st=c

Result 752677

2_Samuel 23:1

752677

phrase 752677 PreC NP

174810

נְאֻ֧ם

subs speech st=c

174811

דָּוִ֣ד

nmpr David st=a

phrase 752677 PreC NP

174812

בֶּן־

subs son st=c

174813

יִשַׁ֗י

nmpr Jesse st=a

phrase 752677 PreC NP|CP

174814

conj and

phrase 752677 PreC NP

174815

נְאֻ֤ם

subs speech st=c

174816

art the

174817

גֶּ֨בֶר֙

subs vigorous man st=a

phrase 752677 PreC NP

174820

מְשִׁ֨יחַ֙

subs anointed st=c

174821

אֱלֹהֵ֣י

subs god(s) st=c

174822

יַֽעֲקֹ֔ב

nmpr Jacob st=a

174823

conj and

174824

נְעִ֖ים

subs pleasant st=c

174825

זְמִרֹ֥ות

subs song st=c

174826

יִשְׂרָאֵֽל׃

nmpr Israel st=a

Result 846461

Psalms 91:2

846461

phrase 846461 Voct NP

326421

מַחְסִ֣י

subs refuge st=a

326422

conj and

326423

מְצוּדָתִ֑י

subs fortification st=a

phrase 846461 Voct NP

326424

אֱ֝לֹהַ֗י

subs god(s) st=a

Result 667957

Genesis 46:12

667957

phrase 667957 PreC PrNP

26195

חֶצְרֹ֥ון

nmpr Hezron st=a

26196

conj and

26197

חָמֽוּל׃

nmpr Hamul st=a

Result 888109

1_Chronicles 3:17

888109

phrase 888109 PreC PrNP

392716

אַסִּ֔ר

nmpr Assir st=a

392717

שְׁאַלְתִּיאֵ֖ל

nmpr Shealtiel st=a

phrase 888109 PreC PrNP|NP

392718

בְּנֹֽו׃

subs son st=a

Result 744242

2_Samuel 2:9

744242

phrase 744242 Cmpl PP

161141

prep to

161142

art the

161143

גִּלְעָ֔ד

nmpr Gilead st=a

161144

conj and

161145

prep to

161146

art the

161147

אֲשׁוּרִ֖י

subs Ashurite st=a

161148

conj and

161149

אֶֽל־

prep to

161150

יִזְרְעֶ֑אל

nmpr <town> st=a

phrase 744242 Cmpl PP|CP

161151

conj and

phrase 744242 Cmpl PP

161152

prep upon

161153

אֶפְרַ֨יִם֙

nmpr Ephraim st=a

161154

conj and

161155

prep upon

161156

בִּנְיָמִ֔ן

nmpr Benjamin st=a

161157

conj and

161158

prep upon

161159

יִשְׂרָאֵ֖ל

nmpr Israel st=a

161160

כֻּלֹּֽה׃ פ

subs whole st=a

Result 839989

Psalms 40:11

839989

phrase 839989 Objc NP

317129

חַסְדְּךָ֥

subs loyalty st=a

317130

וַ֝

conj and

317131

אֲמִתְּךָ֗

subs trustworthiness st=a

Result 668154

Genesis 47:1

668154

phrase 668154 Subj NP

26565

אָבִ֨י

subs father st=a

26566

conj and

26567

אַחַ֜י

subs brother st=a

26568

conj and

26569

צֹאנָ֤ם

subs cattle st=a

26570

conj and

26571

בְקָרָם֙

subs cattle st=a

26572

conj and

26573

subs whole st=c

Result 874390

Esther 2:17

874390

phrase 874390 Objc NP

366497

חֵ֥ן

subs grace st=a

366498

conj and

366499

חֶ֛סֶד

subs loyalty st=a

Result 752311

2_Samuel 21:22

752311

phrase 752311 Cmpl PP

174284

prep in

174285

יַד־

subs hand st=c

174286

דָּוִ֖ד

nmpr David st=a

174287

conj and

174288

prep in

174289

יַ֥ד

subs hand st=c

174290

עֲבָדָֽיו׃ פ

subs servant st=a

Result 778136

Isaiah 20:3

778136

phrase 778136 Modi AdvP

218653

עָרֹ֣ום

advb naked st=a

218654

conj and

218655

יָחֵ֑ף

advb barefoot st=a

Result 661532

Genesis 30:40

661532

phrase 661532 Cmpl PP

16296

prep to

16297

עָקֹ֛ד

subs twisted st=a

16298

conj and

16299

subs whole st=c

16300

ח֖וּם

subs ruttish st=a

phrase 661532 Cmpl PP

16301

prep in

16302

צֹ֣אן

subs cattle st=c

16303

לָבָ֑ן

nmpr Laban st=a

Result 749610

2_Samuel 16:2

749610

phrase 749610 Subj NP

169795

art the

169796

לֶּ֤חֶם

subs bread st=a

169797

conj and

169798

art the

169799

קַּ֨יִץ֙

subs summer st=a

Result 750362

2_Samuel 18:1

750362

phrase 750362 Objc NP

171060

שָׂרֵ֥י

subs chief st=c

171061

אֲלָפִ֖ים

subs thousand st=a

171062

conj and

171063

שָׂרֵ֥י

subs chief st=c

171064

מֵאֹֽות׃

subs hundred st=a

Result 755237

1_Kings 4:4

755237

phrase 755237 Subj PrNP

179116

צָדֹ֥וק

nmpr Zadok st=a

179117

conj and

179118

אֶבְיָתָ֖ר

nmpr Abiathar st=a

Result 831683

Haggai 2:12

831683

phrase 831683 Cmpl PP

304712

prep to

304713

art the

304714

לֶּ֨חֶם

subs bread st=a

304715

conj and

304716

prep to

304717

art the

304718

נָּזִ֜יד

subs boiled food st=a

304719

conj and

304720

prep to

304721

art the

304722

יַּ֧יִן

subs wine st=a

304723

conj and

304724

prep to

304725

שֶׁ֛מֶן

subs oil st=a

304726

conj and

304727

prep to

304728

subs whole st=c

304729

מַאֲכָ֖ל

subs food st=a

Result 789309

Isaiah 66:23

789309

phrase 789309 Time PP

234956

מִֽ

prep from

234957

דֵּי־

subs sufficiency st=c

234958

חֹ֨דֶשׁ֙

subs month st=a

phrase 789309 Time PP

234959

prep in

234960

חָדְשֹׁ֔ו

subs month st=a

phrase 789309 Time PP|CP

234961

conj and

phrase 789309 Time PP

234962

prep from

234963

דֵּ֥י

subs sufficiency st=c

234964

שַׁבָּ֖ת

subs sabbath st=a

phrase 789309 Time PP

234965

prep in

234966

שַׁבַּתֹּ֑ו

subs sabbath st=a

Result 658414

Genesis 24:31

658414

phrase 658414 Objc NP

11725

art the

11726

בַּ֔יִת

subs house st=a

11727

conj and

11728

מָקֹ֖ום

subs place st=a

phrase 658414 Objc NP|PP

11729

לַ

prep to

11730

art the

11731

גְּמַלִּֽים׃

subs camel st=a

Result 889163

1_Chronicles 7:35

889163

phrase 889163 PreC PrNP

395492

צֹופַ֥ח

nmpr Zophah st=a

395493

conj and

395494

יִמְנָ֖ע

nmpr Imna st=a

395495

conj and

395496

שֵׁ֥לֶשׁ

nmpr Shelesh st=a

395497

conj and

395498

עָמָֽל׃

nmpr Amal st=a

Result 884376

Nehemiah 3:25

884376

phrase 884376 Adju PP

384729

prep from

384730

נֶּ֣גֶד

subs counterpart st=c

384731

art the

384732

מִּקְצֹועַ֒

subs corner post st=a

384733

conj and

384734

art the

384735

מִּגְדָּ֗ל

subs tower st=a

phrase 884376 Adju PP|NP

384742

art the

384743

עֶלְיֹ֔ון

subs upper st=a

Result 875893

Esther 9:8

875893

phrase 875893 Objc PP

369342

אֵ֧ת׀

prep <object marker>

369343

פֹּורָ֛תָא

nmpr Poratha st=a

369344

conj and

369345

אֵ֥ת׀

prep <object marker>

369346

אֲדַלְיָ֖א

nmpr Adalia st=a

369347

conj and

369348

אֵ֥ת׀

prep <object marker>

369349

אֲרִידָֽתָא׃

nmpr Aridatha st=a

Result 770582

2_Kings 16:15

770582

phrase 770582 Objc PP

205418

prep <object marker>

205419

עֹֽלַת־

subs burnt-offering st=c

205420

art the

205421

בֹּקֶר֩

subs morning st=a

205422

conj and

205423

prep <object marker>

205424

מִנְחַ֨ת

subs present st=c

205425

art the

205426

עֶ֜רֶב

subs evening st=a

205427

וְֽ

conj and

205428

prep <object marker>

205429

עֹלַ֧ת

subs burnt-offering st=c

205430

art the

205431

מֶּ֣לֶךְ

subs king st=a

205432

conj and

205433

prep <object marker>

205434

מִנְחָתֹ֗ו

subs present st=a

phrase 770582 Objc PP|CP

205435

וְ֠

conj and

phrase 770582 Objc PP

205436

אֵת

prep <object marker>

205437

עֹלַ֞ת

subs burnt-offering st=c

205438

subs whole st=c

205439

עַ֤ם

subs people st=c

205440

art the

205441

אָ֨רֶץ֙

subs earth st=a

phrase 770582 Objc PP|CP

205442

conj and

phrase 770582 Objc PP|NP

205443

מִנְחָתָ֣ם

subs present st=a

205444

conj and

205445

נִסְכֵּיהֶ֔ם

subs libation st=a

Result 664884

Genesis 39:5

664884

phrase 664884 Cmpl PP

21521

prep in

21522

art the

21523

בַּ֖יִת

subs house st=a

21524

conj and

21525

בַ

prep in

21526

art the

21527

שָּׂדֶֽה׃

subs open field st=a

Result 848629

Psalms 107:21

848629

phrase 848629 Objc NP

329644

חַסְדֹּ֑ו

subs loyalty st=a

329645

וְ֝

conj and

329646

נִפְלְאֹותָ֗יו

subs be miraculous nif ptca st=a

phrase 848629 Objc NP|PP

329647

prep to

329648

בְנֵ֥י

subs son st=c

329649

אָדָֽם׃ ׆

subs human, mankind st=a

Result 819569

Ezekiel 39:21

819569

phrase 819569 Objc PP

284777

prep <object marker>

284778

מִשְׁפָּטִי֙

subs justice st=a

phrase 819569 Objc PP|CP

284781

conj and

phrase 819569 Objc PP

284782

prep <object marker>

284783

יָדִ֖י

subs hand st=a

Result 799529

Jeremiah 32:11

799529

phrase 799529 Objc PP

251702

prep <object marker>

251703

סֵ֣פֶר

subs letter st=c

251704

art the

251705

מִּקְנָ֑ה

subs purchase st=a

251706

prep <object marker>

251707

art the

251708

חָת֛וּם

subs seal qal ptcp st=a

phrase 799529 Objc PP|NP

251709

art the

251710

מִּצְוָ֥ה

subs commandment st=a

251711

conj and

251712

art the

251713

חֻקִּ֖ים

subs portion st=a

phrase 799529 Objc PP|CP

251714

conj and

phrase 799529 Objc PP

251715

prep <object marker>

251716

art the

251717

גָּלֽוּי׃

subs uncover qal ptcp st=a

Result 723111

Judges 1:15

723111

phrase 723111 Objc PP

127757

אֵ֚ת

prep <object marker>

127758

גֻּלֹּ֣ת

subs basin st=c

127759

עִלִּ֔ית

subs upper st=a

127760

conj and

127761

אֵ֖ת

prep <object marker>

127762

גֻּלֹּ֥ת

subs basin st=c

127763

תַּחְתִּֽית׃ פ

subs lower st=a

Issues Tracking¶

This section is dedicated to tracking and dealing with issues that are caused by deficiencies in the BHSA data and which cannot easily be solved even with a patch.

In [87]:

known_issues = {} # curr empty; will add more if they emerge and I cannot immediately address

In [88]:

# for phrase, note in known_issues.items():
#     A.prettyTuple((phrase,)+tuple(nheads[phrase]), seqNumber=note)
#     show_subphrases(phrase)

Cautions¶

The following cases contain cautions as they require further investigation or may be debateable.

In [89]:

cautions = []

ezra_4_13 = '''

book book@en=Ezra
    chapter chapter=4
        verse verse=13
            phrase
                =: word lex=MDH/
'''
ezra413_note = '''
Is the word בלו in this phrase a head or a modifying element of מנדה? It is connected with a maqqeph. 
Its interpretation affects what one does with the next nominal: הלך. The head has been selected in
this way because the pattern in this phrase conforms with other cases where a modifying element connected
with maqqeph is followed by coordination.
'''
cautions.append({'template':ezra_4_13, 'phrasei':3, 'note':ezra413_note})

The cautions are all displayed below with their notes.

In [90]:

for i, caution in enumerate(cautions):
    caution_res = A.search(caution['template'], silent=True)
    phrase = caution_res[0][caution['phrasei']]
    A.prettyTuple((phrase,)+tuple(nheads[phrase]), seqNumber=caution['note'])

Result * Is the word בלו in this phrase a head or a modifying element of מנדה? It is connected with a maqqeph. Its interpretation affects what one does with the next nominal: הלך. The head has been selected in this way because the pattern in this phrase conforms with other cases where a modifying element connected with maqqeph is followed by coordination.

Ezra 4:13

881928

phrase 881928 Objc NP

379774

מִנְדָּֽה־

subs tax st=a

379775

בְלֹ֤ו

subs tribute st=a

379776