23 August 2022

Long false positives

In the previous entries we learned that the getrefs algorithm usually finds real quotations, especially if the match is long enough. In Psalms, the longest false positive is 34. In Isaiah, the same number is 37. In this entry we focus on the other books in the Old Testament.

I performed the same steps by using the getrefs command in the same way like for the Psalms and Isaiah. Here are the results: In Genesis, the longest false positive is 35 (16:11), in Exodus, they are 52 and 51 (20:11, refrained in Acts 4:24 and 14:15), and I continued finally with writing a script that performs this check in all books automatically. The result (after removing duplicates) can be found in a CSV file (warning, it contains 127531 rows after the header). The first 64 entries contain real matches with lengths from 318 to 53, with two exceptions. (Also, I skipped the two disputable matches from Exodus 20:11 (lengths 52 and 51) and continued until lengths 35. Actually, most matches are real ones, and not just false positives. But, from the view of a possible strict decision process, one has to stop at 52.)

It is important that among the first 64 entries there are matches of lengths 75 and 57 (of Psalms 95:7, in Hebrews 3:15 and 4:7) which are repetitions (the first occurrence is in Hebrews 3:7, on length 134). There is a similar repetition of length 52 (of Psalms 95:11, in Hebrews 4:3) which repeats the first occurrence from Hebrews 3:10.

An outcome of this check was to start a deeper study of Acts 7. Stephen, its main character, before being stoned to death, summarizes important milestones of Jewish history. Several passages are cited by him, in some cases, literally, but sometimes just a summary can be read. I have the feeling that his literal citations and the existence of a direct introduction have a positive correlation. I plan to work on this in the following days.

In fact, the most important result of the check is that long matches always point to real quotations. This statement is valid until length 52 if the repetitions from Hebrews are ignored. Natural logic suggests that this is the case for all matching texts that are long enough. My check confirmed this natural assumption and gave a numerical result, in addition, for the border case.


Entries on topic internal references in the Bible

  1. Web version of bibref (12 January 2022)
  2. Order in chaos (17 January 2022)
  3. Reproducibility and imperfection (20 January 2022)
  4. A student of Gamaliel's (23 January 2022)
  5. Non-literal matches in the Romans (26 January 2022)
  6. Literal matches: minimal uniquity and maximal extension (31 January 2022)
  7. Literal matches: the minunique and getrefs algorithms (1 February 2022)
  8. Non-literal matches: Jaccard distance (2 February 2022)
  9. Non-literal matches in the Romans: Part 2 (3 February 2022)
  10. A summary on the Romans (5 February 2022)
  11. The Psalms (6 February 2022)
  12. The Psalms: Part 2 (7 February 2022)
  13. A classification of structure diagrams (15 February 2022)
  14. Isaiah: Part 1 (19 February 2022)
  15. Isaiah: Part 2 (26 February 2022)
  16. Isaiah: Part 3 (2 March 2022)
  17. Isaiah: Part 4 (7 March 2022)
  18. Isaiah: Part 5 (15 March 2022)
  19. Isaiah: Part 6 (23 March 2022)
  20. Isaiah: Part 7 (30 March 2022)
  21. A summary (7 April 2022)
  22. On the Wuppertal Project, concerning Matthew (17 July 2022)
  23. Matthew, a summary (25 July 2022)
  24. Isaiah, a second summary (31 July 2022)
  25. Long false positives (23 August 2022)
  26. A general visualization (25 August 2022)
  27. Stephen's defense speech (19 September 2022)
  28. Statistical Restoration Greek New Testament (31 July 2023)
  29. Qt version of bibref (11 March 2024)

Zoltán Kovács
Linz School of Education
Johannes Kepler University
Altenberger Strasse 69
A-4040 Linz