Large Scale Management of Physicists Personal Analysis Data Without Employing User and Group Quotas. Issue 3 (December 2015)
- Record Type:
- Journal Article
- Title:
- Large Scale Management of Physicists Personal Analysis Data Without Employing User and Group Quotas. Issue 3 (December 2015)
- Main Title:
- Large Scale Management of Physicists Personal Analysis Data Without Employing User and Group Quotas
- Authors:
- Norman, A.
Diesbug, M.
Gheith, M.
Illingworth, R.
Lyon, A.
Mengel, M. - Abstract:
- Abstract: The ability of modern HEP experiments to acquire and process unprecedented amounts of data and simulation have lead to an explosion in the volume of information that individual scientists deal with on a daily basis. Explosion has resulted in a need for individuals to generate and keep large personal analysis data sets which represent the skimmed portions of official data collections, pertaining to their specific analysis. While a significant reduction in size compared to the original data, these personal analysis and simulation sets can be many terabytes or 10s of TB in size and consist of 10s of thousands of files. When this personal data is aggregated across the many physicists in a single analysis group or experiment it can represent data volumes on par or exceeding the official production samples which require special data handling techniques to deal with effectively. In this paper we explore the changes to the Fermilab computing infrastructure and computing models which have been developed to allow experimenters to effectively manage their personal analysis data and other data that falls outside of the typically centrally managed production chains. In particular we describe the models and tools that are being used to provide the modern neutrino experiments like NOvA with storage resources that are sufficient to meet their analysis needs, without imposing specific quotas on users or groups of users. We discuss the storage mechanisms and the caching algorithmsAbstract: The ability of modern HEP experiments to acquire and process unprecedented amounts of data and simulation have lead to an explosion in the volume of information that individual scientists deal with on a daily basis. Explosion has resulted in a need for individuals to generate and keep large personal analysis data sets which represent the skimmed portions of official data collections, pertaining to their specific analysis. While a significant reduction in size compared to the original data, these personal analysis and simulation sets can be many terabytes or 10s of TB in size and consist of 10s of thousands of files. When this personal data is aggregated across the many physicists in a single analysis group or experiment it can represent data volumes on par or exceeding the official production samples which require special data handling techniques to deal with effectively. In this paper we explore the changes to the Fermilab computing infrastructure and computing models which have been developed to allow experimenters to effectively manage their personal analysis data and other data that falls outside of the typically centrally managed production chains. In particular we describe the models and tools that are being used to provide the modern neutrino experiments like NOvA with storage resources that are sufficient to meet their analysis needs, without imposing specific quotas on users or groups of users. We discuss the storage mechanisms and the caching algorithms that are being used as well as the toolkits are have been developed to allow the users to easily operate with terascale+ datasets. … (more)
- Is Part Of:
- Journal of physics. Volume 664:Issue 3(2015)
- Journal:
- Journal of physics
- Issue:
- Volume 664:Issue 3(2015)
- Issue Display:
- Volume 664, Issue 3 (2015)
- Year:
- 2015
- Volume:
- 664
- Issue:
- 3
- Issue Sort Value:
- 2015-0664-0003-0000
- Page Start:
- Page End:
- Publication Date:
- 2015-12
- Subjects:
- Physics -- Congresses
530.5 - Journal URLs:
- http://www.iop.org/EJ/journal/1742-6596 ↗
http://ioppublishing.org/ ↗ - DOI:
- 10.1088/1742-6596/664/3/032024 ↗
- Languages:
- English
- ISSNs:
- 1742-6588
- Deposit Type:
- Legaldeposit
- View Content:
- Available online (eLD content is only available in our Reading Rooms) ↗
- Physical Locations:
- British Library DSC - 5036.223000
British Library DSC - BLDSS-3PM
British Library HMNTS - ELD Digital store - Ingest File:
- 7704.xml