Text and data mining
Information and permission request
AM is always interested in supporting research initiatives and learning more about how our products are used.
Data mining as an activity is no different from all other usage of our products and must conform to all the standard requirements in our licence agreement, e.g. it is carried out by Authorised Users under academic purposes. This includes usage of data in AI programs including Large Language Models.
AM recognises the benefits that Data Mining has for new research in the Humanities and Social Sciences and we are committed to enabling these research methods on the following principles:
1. We allow Data Mining/Text Analysis by "Authorised Users" for non-commercial academic research, with some restrictions and requirements (as outlined below).
2. Secure transfer of data to a university server can be made on submission of the adjacent text and data mining request form.
3. Data can be extracted from the main collection website by automated software on submission of the adjacent text and data mining request form and subsequent permission granted to do so.
4. For the avoidance of doubt, data is defined to be (but not restricted to) the sources included in an AM resource and on the AM corporate website, and any derivatives therein, e.g. transcripts, digital surrogates, metadata, editorial text.
Restrictions and requirements on data mining
1. It is prohibited to input data into a data retaining AI model.
2. This use of web crawlers by AI Large Language Models for training purposes is restricted and cannot take place without permission.
3. Large volumes of data extracted or full data sets provided from the products must be stored in a secure way that does not expose data to unauthorised/open usage which is in breach of the User Licence agreement.
4. Performance of live product websites for standard usage must not be damaged by any automated data mining processing. We reserve the right to restrict or refuse this activity if this is impacted.
As a result, any significant automated data extraction or provision of large volumes of data is unauthorised without receiving written request and in an offline data supply; permission being granted in writing. As long as suitable assurances as to the purpose and security of the research is assured on completion of the adjacent text and data mining request form then this provision will not be unreasonably withheld.
The licensor and copyright holder of Licensed Materials must be acknowledged in published text analysis research results derived from the Licensed Materials.
All use of original source data and the results of searching and extracting data therefrom shall be in strict accordance with the terms of the AM Licence Agreement and copyright law. Any other use is prohibited.