Centro de Documentacion de Fundación MAPFRE - Accelerating the computation of shapley effects for datasets with many observations

<?xml version="1.0" encoding="UTF-8"?><modsCollection xmlns="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-8.xsd">
<mods version="3.8">
<titleInfo>
<title>Accelerating the computation of shapley effects for datasets with many observations</title>
</titleInfo>
<name type="personal" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="MAPA20140009800">
<namePart>Tzougas, George</namePart>
<nameIdentifier>MAPA20140009800</nameIdentifier>
</name>
<name type="corporate" xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="MAPA20200009078">
<namePart>Springer Nature</namePart>
<nameIdentifier>MAPA20200009078</nameIdentifier>
</name>
<typeOfResource>text</typeOfResource>
<genre authority="marcgt">periodical</genre>
<originInfo>
<place>
<placeTerm type="code" authority="marccountry">che</placeTerm>
</place>
<dateIssued encoding="marc">2025</dateIssued>
<issuance>serial</issuance>
</originInfo>
<language>
<languageTerm type="code" authority="iso639-2b">eng</languageTerm>
</language>
<physicalDescription>
<form authority="marcform">print</form>
</physicalDescription>
<abstract displayLabel="Summary">The document presents a strategy to accelerate the computation of Shapley effects, a sensitivity-analysis method used to identify the importance of risk factors in actuarial models. The traditional procedure becomes computationally expensive when dealing with large datasets. The authors propose reducing the sample size using techniques such as Latin Hypercube Sampling, Conditional Latin Hypercube Sampling, and Hierarchical k-means, selecting representative observations while preserving calculation accuracy. They apply this methodology to the well-known French automobile claim-frequency dataset, demonstrating drastic reductions in computation time with minimal loss of precision. The study concludes that this approach enables efficient estimation of Shapley effects even in big-data contexts, providing a relevant advancement for actuarial modeling and insurance risk analysis</abstract>
<note type="statement of responsibility">Giovanni Rabitti and George Tzougas
</note>
<subject xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="MAPA20140022717">
<topic>Big data</topic>
</subject>
<subject xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="MAPA20080592011">
<topic>Modelos actuariales</topic>
</subject>
<subject xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="MAPA20140007837">
<topic>Clusters</topic>
</subject>
<subject xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="MAPA20080570651">
<topic>Siniestralidad</topic>
</subject>
<subject xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="MAPA20170005476">
<topic>Machine learning</topic>
</subject>
<classification authority="">6</classification>
<relatedItem type="host">
<titleInfo>
<title>European Actuarial Journal</title>
</titleInfo>
<originInfo>
<publisher>Cham, Switzerland  : Springer Nature Switzerland AG,  2021-2022</publisher>
</originInfo>
<identifier type="local">MAP20220007085</identifier>
<part>
<text>11/08/2025 Volume 15 - Number 2 - August  2025 , p. 885 - 898</text>
</part>
</relatedItem>
<recordInfo>
<recordContentSource authority="marcorg">MAP</recordContentSource>
<recordCreationDate encoding="marc">260206</recordCreationDate>
<recordChangeDate encoding="iso8601">20260211184612.0</recordChangeDate>
<recordIdentifier source="MAP">MAP20260002972</recordIdentifier>
<languageOfCataloging>
<languageTerm type="code" authority="iso639-2b">spa</languageTerm>
</languageOfCataloging>
</recordInfo>
</mods>
</modsCollection>