FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear Bandits

Sunrit Chakraborty; Saptarshi Roy; Debabrota Basu

Pré-Publication, Document De Travail Année : 2024

FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear Bandits

(1) , (1) , (2, 3, 4, 5, 6)

1
2
3
4
5
6

Sunrit Chakraborty

Fonction : Auteur

University of Michigan [Ann Arbor]

Saptarshi Roy

Fonction : Auteur

University of Michigan [Ann Arbor]

Debabrota Basu

Fonction : Auteur
PersonId : 742129
IdHAL : debabrota-basu

Scool

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Inria Lille - Nord Europe

Université de Lille

Centrale Lille

Résumé

High dimensional sparse linear bandits serve as an efficient model for sequential decision-making problems (e.g. personalized medicine), where high dimensional features (e.g. genomic data) on the users are available, but only a small subset of them are relevant. Motivated by data privacy concerns in these applications, we study the joint differentially private high dimensional sparse linear bandits, where both rewards and contexts are considered as private data. First, to quantify the cost of privacy, we derive a lower bound on the regret achievable in this setting. To further address the problem, we design a computationally efficient bandit algorithm, \textbf{F}orgetfu\textbf{L} \textbf{I}terative \textbf{P}rivate \textbf{HA}rd \textbf{T}hresholding (FLIPHAT). Along with doubling of episodes and episodic forgetting, FLIPHAT deploys a variant of Noisy Iterative Hard Thresholding (N-IHT) algorithm as a sparse linear regression oracle to ensure both privacy and regret-optimality. We show that FLIPHAT achieves optimal regret up to logarithmic factors. We analyze the regret by providing a novel refined analysis of the estimation error of N-IHT, which is of parallel interest.

Mots clés

Bandits Contextual Bandits Differential privacy Regret Bounds High dimensional sparse data Sparsity Linear bandits

Domaines

Machine Learning [stat.ML] Intelligence artificielle [cs.AI] Cryptographie et sécurité [cs.CR] Apprentissage [cs.LG] Théorie [stat.TH]

Debabrota Basu : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04615697

Soumis le : mardi 18 juin 2024-12:40:35

Dernière modification le : mercredi 19 juin 2024-03:22:00

Dates et versions

hal-04615697 , version 1 (18-06-2024)

Licence

Paternité

Identifiants

HAL Id : hal-04615697 , version 1
ARXIV : 2405.14038

Citer

Sunrit Chakraborty, Saptarshi Roy, Debabrota Basu. FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear Bandits. 2024. ⟨hal-04615697⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 UNIV-LILLE CRISTAL-SCOOL ANR PEPR_IA

20 Consultations

0 Téléchargements

FLIPHAT: Joint Differential Privacy for High Dimensional Sparse Linear Bandits

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager