Abstract
The HRAF databases, eHRAF World Cultures and eHRAF Archaeology, each containing large corpora of curated text subject-indexed at the paragraph-level by anthropologists, were designed to facilitate rapid retrieval of information. The texts describe social and cultural life in past and present societies around the world. As of the spring of 2018, eHRAF contains almost three million indexed “paragraph” units from over 8000 documents describing over 400 societies and archaeological traditions. This chapter first discusses concrete problems of scale resulting from large numbers of complex elements retrieved by any given search. Second, we discuss potential and partial solutions that resolve these problems to advance research, whether based on specific hypotheses, classification or identifying and evaluating embedded patterns of relationships. Third, we discuss new kinds of research possibilities that can be further advanced, have not yet been successfully attempted, or have not even been considered using anthropological data because of scale and complexity of achieving a result.