DOI QR코드

DOI QR Code

Fast Range Query on Encrypted Multi-dimensional Data in Cloud Environment

  • Zhuolin Mei (School of Computer and Big Data Science, Jiujiang University) ;
  • Jing Zeng (China Gridcom Co., Ltd.) ;
  • Caicai Zhang (School of Modern Information Technology, Zhejiang Polytechnic University of Mechanical and Electrical Engineering) ;
  • Shimao Yao (School of Computer and Big Data Science, Jiujiang University) ;
  • Jiaoli Shi (School of Computer and Big Data Science, Jiujiang University) ;
  • Bin Wu (School of Computer and Big Data Science, Jiujiang University)
  • Received : 2024.03.10
  • Accepted : 2024.07.31
  • Published : 2024.09.30

Abstract

Cloud computing has extensively grown in recent years. A large amount of data is stored in cloud servers. To ensure confidentiality, these data is often encrypted and then stored in cloud servers. However, encryption makes range queries difficult to perform. To solve this issue, we present a scheme that facilitates fast range queries on encrypted multi-dimensional data in scenarios involving multiple users. In our scheme, we construct a tree index on encrypted multi-dimensional data, and each node is linked to a secure enhanced multi-dimensional range (MDR). To support efficient range query on the tree index, we adopt bloom filter technique. Additionally, users' privileges are designed in a one-way calculation manner to support that different users can only perform range queries within their own privileges. Finally, we conduct extensive experiments which show the efficiency of our scheme, and also conduct a thorough analysis of its security.

Keywords

1. Introduction

Cloud computing is a very attractive technology[1]. By leveraging cloud servers for outsourcing data and services, data owner and users can enjoy various advantages[2]. However, as the outsourced data stored on remote cloud servers is not directly under the control of data owners, data privacy emerges as a paramount concern[3, 4]. An effective approach to address this concern is to encrypt the data before outsourcing it. However, as ciphertexts are indistinguishable with each other, basic data manipulations become very difficult, such as queries[5].

Numerous novel encryption schemes have been proposed to facilitate range queries in recent years, but there still exist limitations. Order preserving encryption methods support efficient comparison between the ciphertexts of numerical data[6]. However, according to the ordering of ciphertexts, plaintexts can be precisely estimated. To preserve the ordering of ciphertexts, researchers have proposed bucketization methods[7, 8, 9, 10]. These methods involve dividing the data into multiple buckets, encrypting them in each bucket collectively. This ensures that the ordering of ciphertexts within a bucket remains secure. However, a limitation of these methods is that they do not support scenarios involving multiple users. To solve this issue, public-key based methods are proposed[11, 12, 13]. However, these public key-based methods inherently involve a considerable amount of intricate computations. Thus, the main shortage of these public key-based methods is inefficient. As the technique of bloom filter[14] can be utilized to efficiently judge whether a datum belongs to a set or not, many methods[15, 16, 17, 18] adopt bloom filter to achieve secure range query on ciphertexts. Many works[19, 20] focus on proposing frameworks and heightened security for range query over ciphertexts. Existing searchable keyword encryption schemes can be applied in these frameworks and then provide secure and fast range queries on ciphertexts. However, the above bloom filter based methods and the proposed frameworks for range queries on ciphertexts fail to account for the handling of multi-dimensional data.

In this paper, we propose a scheme which enables secure and fast range queries on encrypted multi-dimensional data in scenarios involving multiple users. In our scheme, we construct a tree index, and each node is linked to a secure enhanced multi-dimensional range (MDR). All the data covered by the MDR of each leaf node is encrypted by using a secure encryption scheme (e.g. AES) and stored under the leaf node. Then, the data owner enhances the security of the MDR of each node in the tree index by using one-way hash functions (e.g. SHA families). Next, the data owner generates a binary string for each security enhanced MDR by using bloom filter, and stores the binary string in the corresponding node. After all the MDRs associated with the nodes have been transformed to binary strings, the data owner deletes all the information in these nodes (except the binary strings and the links between nodes) to form a secure and efficient interval tree index (denoted by SEITI). Additionally, the data owner sets different privilege values for different nodes of SEITI and constructs a one-way calculation between these privilege values. Finally, the data owner outsources encrypted multi-dimensional data and SEITI to the cloud server, and distributes different privilege values to users. A user can use his/her privilege value to efficiently and securely perform range query on SEITI only within his/her privilege.

We summarize our scheme into the following three aspects: (1) We propose a tree index SEITI based on MDR encoding, one-way hash functions and bloom filter. (2) We propose an efficient and secure range query method over encrypted multi-dimensional data by using SEITI. (3) We conduct many experiments which demonstrate the efficiency of our scheme, and we also analyze its security.

2. Related Work

There are primarily four categories of secure and fast range query methods over encrypted data: order preserving encryption methods, bucketization methods, bloom filter based methods and searchable symmetric encryption methods.

In OPE, data is encrypted in an ordered manner, allowing for fast ciphertext retrieval[6]. Popa et al.[21] build a mutable OPE that achieves the ideal security of OPE. In this method, a binary tree is a state of the encryption which is maintained on a server. Intensive client-to-server interactions and mutations on previous ciphertexts are required when encrypting new numeric data. In [22], authors have improved mutable OPE by using random binary trees.

In bucketization methods, data is divided into different buckets. Each bucket is assigned an identity. Thus, the ordering of ciphertexts in a bucket can be well protected. When the queried range overlaps with a bucket, the ciphertexts in the bucket will be returned as part of the query results. Wang et al.[8] construct an index \(\begin{align}\hat{R}\end{align}\) − tree . Each node of \(\begin{align}\hat{R}\end{align}\) − tree is linked to an encrypted minimum bounding rectangle (MBR). The adopted encryption scheme is asymmetric scalar-product preserving encryption (ASPE)[23]. Lee[9] proposes an ordered bucketization method. According to the ordering of buckets, query results can be efficiently found. However, Lee’s method only considers the ciphertexts of single dimensional data. Given a range, ASPE can be used to judge whether the range intersects with an MBR. Thus, Wang’s method allows a user to find the query resultsin a top-down manner by using \(\begin{align}\hat{R}\end{align}\) − tree. Mei et al.[24] construct key derivation in their tree index to support multiple users with different search privileges.

Li et al.[15] propose a scheme called PBtree to perform secure range query. According to prefix encoding and bloom filter, the index in this scheme achieves index indistinguishability. In each node of the index, all the data are encoded and then form a binary string by using one-way hash function and bloom filter. When there is a large amount of data, the binary strings are very long, which incur a large space overhead of PBtree. In [16], Wang et al. propose a method which supports range query for arbitrary geometry over ciphertexts. Each data point and geometric range query are represented as binary strings of bloom filters. Whether a point lies within a geometric area is determined by the inner product of two binary strings. However, as the data or query scope increases, it becomes necessary to create large bloom filters that covers all potential points within a specific query range. This, in turn, results in increased storage and computational costs. Then, Wang et al.[26] propose an improved method based on [16]. However, as the query scope increases, it also requires exhausting all the data that may be searched during the entire query process. Cui et al.[17] propose a privacy preserving solution to support boolean range query on ciphertexts. They present a spatio-textual data encoding technique based on bloom filter and assess the effectiveness of bloom filters using ASPE[23]. Mahdikhani et al.[18] employs Paillier homomorphic cryptosystem and ingenious bloom filter data structure for simultaneously achieving better privacy and higher efficiency in the count aggregation in a privacy-preserving range query scenario.

Faber et al.[27] employ a binary tree structure to handle dataset. They leverage a static SSE to support range queries over ciphertexts. Demertzis et al.[28] also utilize a similar index as used in [27], and introduce several range query schemes. Each scheme offers a distinct trade-off between security and efficiency. Zuo et al.[19] introduce two schemes incorporating range-covering technique to facilitate range queries on ciphertexts. Additionally, their schemes offer flexible update functionality. Wang et al.[20] propose a scheme which supports range queries over dynamic database with forward/backward security. However, these schemes only consider the range query on the ciphertexts of single dimensional data.

3. System and Security Model

Fig. 1 shows the architecture of our scheme. Data owner (DO) outsources encrypted multi-dimensional data and a secure and efficient interval tree index (SEITI) to the cloud server (CS). Then, DO distributes privilege values, privilege paths and the information about MDRs to users (Us). If a user (U) maintains the privilege value and the privilege path of an MDR (denoted by MDR ), he/she can generate the search token for any sub-MDR (denoted by MDR', MDR' ⊆ MDR ). Then, U sends the search token of MDR' to CS. After receiving the search token, CS performs range query on SEITI and finds all the encrypted multi-dimensional data in MDR'. Finally, CS returns the search results to U.

E1KOBZ_2024_v18n9_2717_4_f0001.png 이미지

Fig. 1. Architecture of Our Scheme.

In our scheme, we adopt the honest but curious model for CS. The term "honest" implies that CS (i) faithfully adheres to the designated protocols and procedures to fulfill its role as a service provider, and (ii) refrains from modifying the ciphertexts or secure index stored on it. On the other hand, the term "curious" indicates that CS may exhibit curiosity, either due to its own inherent curiosity or as a result of being compromised to act on behalf of a third party. In our scheme, the threat model relies on the information available to adversaries (such as CS), which is also used in [29, 30].

Definition 1 (Security [31]). Given a leakage function F , all adversaries cannot reveal more information except the leakage function F , then the range query scheme is secure. The leakage function F is defined as F(mi, mj) = positiondiff (mi, mj) where positiondiff (mi, mj) gives the different first position of mi and mj .

4. Construction of Secure and Efficient Interval Tree Index (SEITI)

In our scheme, DO first builds an interval tree over the outsourced data. Second, DO assigns multi-dimensional range (MDR) to each node of the interval tree. In each leaf node, DO uses a secure encryption algorithm (e.g. AES) to encrypt all the outsourced data covered by the MDR of the leaf node, and then stores these ciphertexts under the leaf node. Next, DO encodes each MDR into an MDR code, and then enhances the security of each MDR code to generate a secure and efficient interval tree index (SEITI). Finally, DO outsources all the ciphertexts and SEITI to CS.

In the following paragraphs, we present the construction of SEITI. The construction of SEITI consists of three parts: Multi-Dimensional Range (MDR) assignment, MDR code generation and security enhancement of MDR code.

MDR Assignment. In our scheme, SEITI is a full n-ary tree, where n is an integer and calculated by division parameters (please refer to the next paragraph). The structure of SEITI is helpful for Us to calculate appropriate privilege values and then generate the corresponding search tokens for their queried ranges (please refer to Section 5 and 6). Each node of SEITI is associated with an MDR. In SEITI, the MDR of the root node covers all the outsourced data. For each internal node, its MDR is uniformly divided into n smaller MDRs by division parameters. These smaller MDRs are assigned to the child nodes of the internal node respectively. For each leaf node, all the outsourced data covered by its MDR are encrypted as a unit by using a secure encryption scheme, and stored under the leaf node.

For the ith dimension, DO defines the division parameter divi ( divi is a positive integer). According to divi , a range Ri = (ai, bi] on the ith dimension can be uniformly divided into divi smaller ranges, (ai, ai + ∆i] , (ai + ∆i, ai + 2∆i], … , (ai + (divi - 1)∆i, ai + divii], where ∆i = (bi - ai)/divi. Similarly, an MDR MDR = (a0, b0] × (a1, b1] ×…× ( ad−1, bd−1] can be uniformly divided into \(\begin{align}n=\prod_{i=0}^{d-1} d i v_{i}\end{align}\) smaller MDRs as follows.

{(a0 + o0Δ0, b0 + (o0 + 1)Δ0] × (a1 + o1Δ1, b1 + (o1 + 1)Δ1] ×…× (ad-1 + od-1Δd-1, bd-1 + (od-1 + 1)Δd-1]|i = 0,1,…, d - 1, oi ∊ {0,1,…,divi - 1},Δi = (bi - ai)/divi}.

Suppose A is the root node of SEITI and associated with MDRA . According to the division parameters, MDRA is uniformly divided into \(\begin{align}n=\prod_{i=0}^{d-1} d i v_{i}\end{align}\) smaller MDRs and each smaller MDR is assigned to a child node of A . By using the method iteratively, each node of SEITI can be assigned to an MDR. Thus, if the MDR of a node in SEITI is known, all the MDRs of its descendant nodes can be obtained efficiently. Example 1 illustrates the MDR assignment for the nodes in SEITI.

Example 1. As shown in Fig. 2, the division parameters are div0 = 3 and div1 = 2 . SEITI is a full 6-ary tree ( 6 = div0 × div1 ). The height of SEITI is 4. The root node A is associated with (0, 243] × (0,128] . (0, 243] × (0,128] is uniformly divided into smaller MDRs iteratively, and each smaller MDR is assigned to the descendant nodes of A .

E1KOBZ_2024_v18n9_2717_5_f0001.png 이미지

Fig. 2. MDR Assignment for the Nodes in SEITI.

MDR Code Generation. MDR code generation involves two steps, range encoding and MDR encoding.

Range Encoding. Suppose (xi, yi] is a range which can be obtained by using divi to divide (ai, bi] iteratively and the length of range code is l . To encode the range (xi, yi] , one can employ the subsequent steps.

(1) DO builds a full divi -ary tree (denoted by Ti). The height of Ti is l +1.

(2) DO assigns (ai, bi] to the root node (denoted by Ai) of Ti . Then, DO sets the range code of (ai, bi] to **…* , where each * can be any value in 0,1,…, divi −1(e.g. **…* can be 00…0 , 00…1, … , 00… (divi − 1) , … ,(divi − 1)(divi −1)…(divi − 1), etc.).

(3) DO uniformly divides (ai, bi] into divi smaller ranges, which are (ai, ai + ∆i] , (ai + ∆i, ai + 2∆i] , … , (ai + (divi − 1)∆i ,ai + divii ] , where ∆i = (bi− ai)/divi . Then, DO assigns these smaller ranges to the child nodes of Ai , and sets their range codes to 0*…* , 1*…* , … , (divi − 1)*…* , respectively.

(4) By executing the above procedure iteratively, each node of Ti is associated with a smaller range and the range can be encoded to a range code. As the smaller range (xi, yi] can be obtained by using divi and (ai, bi] , some node in Ti is associated with (xi, yi] . Thus, the range code of (xi, yi] can be easily obtained.

As shown in Fig. 3, T0 is a tree over the range (0, 243] . The division parameter is div0 = 3. The length of range codes is set to l = 3 . The height of T0 is set to 4 ( 4 = l + 1 ). The root node A0 is associated with (0,243] and its range code is *** , where each * can be any value in 0, 1, 2 (2 = div0 −1). (0,243] is uniformly divided into 3 (div0 = 3) smaller ranges, (0,81], (81,162] and (162,243]. These smaller ranges are associated with the nodes B0 , C0 and D0 , and their range codes are 0** , 1** and 2** respectively. By executing the above procedure iteratively, the smaller range of each node in T0 can be encoded, e.g. the smaller ranges (0,27], (27,54] and (54,81] are encoded to 00*, 01* and 02* respectively.

E1KOBZ_2024_v18n9_2717_6_f0001.png 이미지

Fig. 3. Range Encoding in (0,243].

It is easy to find the relationship between a range and its range code. For simplicity, let c0c1…cl−1 be the range code of (xi, yi] , where cj = *,0,1,…, divi−1(j = 0,1,…,l−1).

xi = ai + Si × (c0 × divl-1i + c1 × divl-2i +…+ cl-1 × div0i)       (1)

yi = ai + Si × (c0 × divl-1i + c1 × divl-2i +…+ cl-1 × div0i) + Si       (2)

, where (ai, bi] is the range associated with the root node of Ti and Si = (bi − ai)/divli.

According to the formula (1) and (2) above, one can calculate (xi, yi] by using c0c1…cl−1. For c0c1…cl−1 , if cj ≠ * (j = 0,1,…,l−1), one takes cj into the formula (1) and (2) to calculate xi and yi . Finally, one can obtain (xi, yi]. If cj = *, (i) one takes cj = 0 into the formula (1) to calculate xi , which is the minimal value represented by c0c1…cl−1 . (ii) One takes cj = divi − 1 into the formula (2) to calculate yi , which is the maximal value represented by c0c1…cl−1 . Finally, one can obtain the range (xi, yi].

One can also calculate c0c1…cl−1 by using (xi, yi]. First, one calculates \(\begin{align}\frac{x_{i}-a_{i}}{S_{i}}=c_{0} \times d i v_{i}^{l-1}+c_{1} \times d i v_{i}^{l-2}+\ldots+c_{l-1} \times d i v_{i}^{0}\end{align}\) by using the formula (1). Then, one divides \(\begin{align}\frac{x_{i}-a_{i}}{S_{i}}\end{align}\) by the power of divi to calculate the reminder iteratively. Finally, one can obtain the code of xi . By using the same method, one can use the formula (2) to obtain the code yi . According to the codes of xi and yi , one can obtain the range code c0c1…cl−1.

According to the analysis above, it is easy to know that (i) there is no need to build a full divi -ary tree Ti , and (ii) by using the formular (1) and (2), the range code c0c1…cl−1 can be efficiently calculated when the range (xi, yi], divi and (ai, bi] are known. (iii) By using the formular (1) and (2), the range (xi, yi] can be efficiently calculated when the range code c0c1…cl−1 , divi and (ai, bi] are known.

MDR Encoding. To encode the MDR MDR = (a0, b0] × (a1, b1] ×…× (ad-1, bd-1], one can employ the subsequent steps.

(1) DO calculates the range codes for R0 = (a0, b0], R1 = (a1, b1], …, Rd = (ad, bd]. The range codes are CR0 , CR1 , …, CRd−1 respectively. DO sets a privilege value pv for MDR (please refer to the next section).

(2) Suppose (i) the range code CRi is ci0ci1…cil−1 (the length of CRi is l), where cij is the jth value of CRi , and (ii) the MDR code CMDR is v0v1…vl−1, where vj is the jth value of CMDR . Then, DO calculates kj = H(j, pv) mod d , where H is a one-way hash function (e.g. SHA families). According to kj , DO chooses CRkj and sets vj to the jth value ckjj of CRkj , i.e. vj = ckjj . Finally, after all the values v0 , v1 , …, vl−1 are determined, the MDR code CMDR = v0v1…vl−1 is obtained.

In our scheme, the MDR code CMDR can be seen as randomly calculated, because (i) the privilege value pv is calculated by using a random number and a one-way hash function (please refer to the next section), (ii) kj is calculated by using H (j, pv) mod d (thus kj can be seen as calculated by using the pseudo-random number pv ), and (iii) vj is set to the jth value ckjj of CRkj , where j = 0,1,...,l−1. Additionally, as the one-way hash function H is used in the procedure of MDR encoding, the privilege value pv is very safe even if the MDR code CMDR is leaked.

Security Enhancement for MDR Code. For security concern, the MDR code CMDR should be protected. This is because the values in CMDR are chosen from the range codes CR0 , CR1 , …, CRd−1 and the number of values in CMDR is not many. Attackers can guess the MDR code CMDR . Thus, DO should enhance the security of CMDR .

(1) DO enhances the security of each value vj ( j= 0,1,…,l−1) in CMDR . As (i) vj is chosen from the range code CRkj and (ii) the values in CRkj are chosen from *,0,1,…,divkj − 1 (please refer to Section 4), vj is one of the values *,0,1,…,divkj − 1 . Thus, (i) if vj ≠ * , DO calculates H(j, H(vj, pv)) . (ii) If vj = * , DO calculates H(j, H(0, pv)) , H H(j, H(1, pv)), … , H(j, H(divkj - 1, pv)) , because * can be any value from 0,1,…,divkj − 1 (please refer to Section 4).

Note that, as H is a one-way hash function, all the information about CMDR (including the position j , the value vj and the privilege value pv ) can be safely protected even if attackers know the value H(j, H(vj, pv))(j= 0,1,…,l−1).

(2) To support range query, DO uses bloom filter (denoted by BF ) to handle H(j, H(vj, pv)). If vj ≠ * , DO calculates vj' = BF(H(j, H(cj, pv))). If cj = * , as * can be any value from 0,1,…,divkj − 1, DO calculates vj' = BF(H(j, H(0, pv)))|BF(H(j, H(1, pv)))|…|BF(H(j, H(divkj - 1, pv))) , where | denotes bitwise OR operation.

(3) Finally, DO calculates the security enhanced MDR code of CMDR , which is [CMDR] = v0'|v1'| … |vl−1' .

In summary, first, DO builds a full n -ary tree and each node is associated with an MDR in the form of plaintext. Second, DO encodes each MDR to an MDR code and then enhances the security of the MDR code. Then, DO replaces the MDR in each node with its corresponding security enhanced form, which is as the SEITI in our scheme. Finally, DO outsources SEITI along with the encrypted multi-dimensional data to CS. Note that, different privilege values are used to enhance the security of different MDR codes (please refer to Section 5).

5. Privilege Assignment and Revocation

In this section, we first introduce privilege value. Privilege values is used to enhance the security of an MDR and describes the privilege of U. If U maintains the privilege value of a node in SEITI, he/she can perform range queries within the smaller MDRs covered by the MDR of the node. Then, we introduce privilege path, which can help CS to efficiently find the MDR which U has the privilege to search. Finally, we present the privilege assignment and revocation.

Privilege Value. For the sake of clarification, suppose A is a node in SEITI, the MDR of A is MDRA , the privilege value of A is pvA , B is a child node of A , the MDR of B is MDRB and the privilege value of B is pvB .

(1) DO chooses a hash function HID (which is public) to calculated the identities of nodes in SEITI, e.g., the identity of A can be calculated IDA = HID (MDRA) . As the MDRs of nodes in SEITI are different, their identities are different. If one knows the MDR of a node, he/she can efficiently calculate the smaller MDRs of its descendant nodes (please refer to Section 4). Thus, if one knows MDRA , he/she can calculate MDRB and then calculate the identity of B , which is IDB = HID (MDRB) .

(2) DO randomly chooses an integer pv as the privilege value of the root node in SEITI. For an internal node, its privilege value is calculated by its parent node. Specifically, if one knows MDRA and pvA , he/she calculates MDRB and IDB= HID (MDRB), and then calculates pvB = H(IDB, pvA) , where H is a one-way hash function. By using the same method, if one knows MDRA and pvA , he/she can calculate the MDR and the privilege value of any descendent node of A . Thus, given the privilege value pv and the MDR of the root node in SEITI, DO can set the privilege value for any node in SEITI.

If DO distributes MDRA and pvA to U, as U can calculate MDRC ( MDR C ⊆ MDR A ) and pvC of C (C is a descendant node of A ), U can search the ciphertexts covered by MDRC (please refer to Section 7). As H is a one-way hash function, U cannot calculate the MDRs and the privilege values of the nodes which are not the descendant nodes of A , which guarantees that U can only perform range queries within his/her privilege.

Privilege Path. The privilege path of A can help CS securely and efficiently find A in SEITI.

In SEITI, there exists a path from the root node to A . The path is denoted by ⟨N0,N1,…,Nm−1⟩ , where N0 denotes the root node, Nm−1 denotes the parent node of A and Ni is the parent node of Ni+1 ( 0 ≤ i < i+1 ≤ m−1). For the nodes on the path, their security enhanced MDR codes can be obtained from SEITI, which is ⟨[CMDRN0], [CMDRN1],…, [CMDRNm-1]⟩. DO randomly generates m − 1 binary strings bsN0, bsN1,…, bsNm-1 , s.t. bsN0 & [CMDRN0] = bsN0 , bsN1 & [CMDRN1] = bsN1, … , bsNm-1 & [CMDRNm-1] = bsNm-1 , where & denotes bitwise AND operation. Finally, DO generates PPA = ⟨bsN0, bsN1,…, bsNm-1⟩ , which is the privilege path of A .

Privilege Assignment. DO distributes ⟨(PPA, IDu)sigDO, PPA, MDRA, pvA⟩ to a user u (it means u has the privilege to search the ciphertexts in MDRA), where (PPA, IDu)sigDO is the signature of DO. The signature is used to verify the legality of u’s privilege when he/she submits query requests to CS.

Privilege Revocation. To revoke the privilege of u , DO should update SEITI in CS and assign new secret parameters to the other Us. First, DO chooses a random number for the root node of SEITI, and then calculates the pseudo-random number for each internal node. Specifically, if rA is the random number of the node A , the random number rB of the node B ( A is the parent of B ) can be calculated, rB = H(IDB, rA), where H is a one-way hash function and IDB is the identity of B . Thus, all the random numbers of the nodes in SEITI can be calculated iteratively. For each node in SEITI, if the random number is r and the security enhanced MDR code is [CMDR], DO calculates a binary string sr = BF(r) , and then calculates sr & [CMDR] . Next, DO replaces [CMDR] with sr & [CMDR] to update SEITI. Finally, if the privilege of u (suppose u holds ⟨(PPA, IDu)sigDO, PPA, MDRA, pvA⟩) is not revoked, DO generates a new privilege path PPA* , and sends ⟨(PPA*, IDu)sigDO, PPA*, MDRA, pvA, rA⟩ to u . Hence, Us whose privileges have been revoked are unable to continue performing queries on SEITI, while Us whose privileges have not been revoked can still carry out queries on SEITI.

6. Search Token Generation

After receiving ⟨(PPA, IDu)sigDO, PPA, MDRA, pvA⟩ , u can generate the search token for the queried multi-dimensional range MDRQ by using the subsequent steps.

(1) u calculates MDRA ∩ MDRQ . If MDRA ∩ MDRQ = ∅ , u has no privilege to search any ciphertexts in MDRQ . Otherwise, u generates the search token for MDRA ∩ MDRQ .

(2) u builds an n -ary tree TSRCH , where n = Πd-1i=0divi . The height of TSRCH is l + 1 − m . There is sub-tree TA in SEITI. The root node of TA is associated with MDRA . The height of TA is l + 1 − m. It is easy to know that TSRCH and TA have the same structure.

(3) u assigns MDRA to the root node of TSRCH and sets pvA as its privilege value. Then, for the other nodes of TSRCH , u iteratively calculates the smaller MDRs and privilege values by using division parameters and one-way hash functions (u can utilize similar method as DO used in Section 4 and Section 5).

(4) According to the MDRs associated with nodes in TSRCH , u calculates the minimal cover of MDRA ∩ MDRQ , which is denoted by MCMDRA∩MDRQ (please refer to the following Definition 2). Then, u prunes TSRCH . For each node N in TSRCH , if N ∉ MCMDRA∩MDRQ or N is not the ancestor node of any node in MCMDRA∩MDRQ, u deletes N and its corresponding links in TSRCH . The pruned TSRCH is denoted by TSRCH' . It is obvious that the set of all the leaf nodes in ' TSRCH equals to MCMDRA∩MDRQ .

Definition 2 (Minimal Cover). Minimal Cover is a set of some nodes in a tree. Suppose A is the root node of the tree, MDRA is the multi-dimensional range associated with A , MDR is a multi-dimensional range ( MDR ⊆ MDRA ). MCMDR is the minimal cover of MDR iff (i) \(\begin{align}M D R \subseteq \bigcup_{N \in M C_{M D R}} M D R_{N}\end{align}\), where N is a node of the tree and MDRN is associated with N ; (ii) ∀MC ∈ {MC | MC ⊆ MCMDR} , \(\begin{align}\bigcup_{N \in M C} M D R_{N} \subseteq M D R\end{align}\).

In the following paragraphs, we present the steps for u to handle TSRCH' . For ease of explanation, we suppose N is a node in TSRCH' , its MDR is MDRN = (a0, b0] × (a1, b1] ×…× (ad-1, bd-1], its privilege value is pvN and its identity is IDN = HID ( MDRN) .

(5) For each node N in TSRCH' , u randomly generates a d -dimensional dataN = (r0, r1,…, rd-1) in MDRN . According to the formula \(\begin{align}\frac{x_{i}-a_{i}}{S_{i}}=c_{0} \times d i v_{i}^{l-1}+c_{1} \times d i v_{i}^{l-2}+\ldots+c_{l-1} \times d i v_{i}^{0}\end{align}\) (please refer to Section 4), u sets xi = ri, and divides \(\begin{align}\frac{r_{i}-a_{i}}{S_{i}}\end{align}\) by the power of divi . Finally, u iteratively calculates the reminder to obtain the code of ri , which is denoted by ci0ci1…cil-1 (i = 0,1,…,d-1).

(6) u chooses some values from these codes ci0ci1…cil-1 (i = 0,1,…,d-1) to encode dataN , denoted by CdataN = t0t1…tl−1 . First, u calculates ki = H(i, pvN) mod d . Then, u chooses the code rki = cki0cki1…ckil-1. Next, u extracts the i th value in rki = cki0cki1…ckil-1 and sets ti to ckii . Finally, u calculates all the values in CdataN = t0t1…tl−1, where ti = ckii and i = 0,1,…,d-1.

Note that, the encoding procedures of a data and an MDR are very similar, but they have two differences: (i) the MDR is encoded by DO to construct SEITI, but the data is encoded by Us to generate search token; (ii) There may exist the symbol * in the code of the MDR, but there is no * in the code of the data, i.e. ti ≠ *(i = 0,1,…,d-1).

(7) u enhances the security of CdataN = t0t1…tl−1. For the value ti (ti ≠ *), u calculates ti' = BF(H(i, H(ti, pvN))), where H is the one-way hash function and BF is the bloom filter. Finally, u obtains the security enhanced CdataN , which is [CdataN]= t0'|t1'|…|tl-1'(|denotes the bitwise OR operation).

(8) For each node N in TSRCH' , u deletes all the information (including IDN , MDRN , pvN and CdataN etc.), retaining only [CdataN]. Then, u obtains the search token of MDRA ∩ MDRQ , denoted by ⟨(PPA, IDu)sigDO, (TSRCH', IDu)sigu, PPA, TSRCH'⟩ , where (TSRCH'*, IDu)sigu is the signature of u . Upon receiving ⟨(PPA, IDu)sigDO, (TSRCH', IDu)sigu, PPA, TSRCH'⟩, CS can verify the legality of the privilege of u through (PPA, IDu)sigDO , and verify the legality of the query token through (TSRCH', IDu)sigu .

Note that, if the privileges of some Us have been revoked, DO should update the privileges of the rest Us. Suppose that after the update, u holds ⟨(PPA, IDu)sigDO, PPA*, MDRA, pvA, tA⟩. To generate a search token, u firstly generates TSRCH' . Second, for each node N of TSRCH' , u calculates rN by using rA (please refer to Section 5) and calculates the binary string srN = BF(rN) . Then, u extracts [CdataN] in N and calculates srN & [CdataN]. Next, for each node N , u replaces srN & [CdataN] with N [CdataN] (the updated TSRCH' is denoted as TSRCH'* ). Finally, u obtains the search token of MDRA ∩ MDRQ , which is ⟨(PPA*, IDu)sigDO, (TSRCH'*, IDu)sigu, PPA*, TSRCH'*⟩.

7. Range query and Data Update

Privilege Assignment. To search the ciphertexts in MDRA ∩ MDRQ , u sends ⟨(PPA, IDu)sigDO, (TSRCH', IDu)sigu, PPA, TSRCH'⟩ to CS. According to PPA , CS can find the node A in SEITI. Then, by using TSRCH' , CS can find all the ciphertexts in MDRA ∩ MDRQ . The details of range query are as follows.

(1) As PPA = ⟨bsN0, bsN1,…, bsNm-1⟩, CS extracts bsN0 from PPA . Then CS extracts [CMDRM0] from the root node of SEITI, where [CMDRM0] is the secure enhanced MDR code of M0 (suppose the root node of SEITI is M0). Then, CS tests whether bsN0 & [CMDRM0] = bsN0 , where & denotes bitwise AND operation. If bsN0 & [CMDRM0] = bsN0 , CS continues to test bsN1 and [CMDRM1] , where [CMDRM1] is the secure enhanced MDR code of M1 (suppose M1 is a child node of M0). By using the method iteratively, if PPA is legal, CS can find the node A in SEITI by using PPA .

(2) CS tests the child nodes of A with the root node of TSRCH' in turn. Suppose B is a child node of A and B' is the root node of TSRCH' , CS tests whether [CdataB'] & [CMDRB] = [CdataB']. If the equation holds, it means that the node B in SEITI corresponds to the node B' in TSRCH' . Then, CS tests each child node of B with each child node of B' by using the above method iteratively. Finally, CS can find all the nodes in SEITI which correspond to the leaf nodes in TSRCH' . As the set of all the leaf nodes in TSRCH' is the minimal cover of MDRA ∩ MDRQ (please refer to Section 6), these corresponding nodes in SEITI cover all the ciphertexts covered by MDRA ∩ MDRQ . Finally, CS returns all the ciphertexts under these corresponding nodes in SEITI to u as the search results.

Note that, if the distribution of the outsourced data is approximately uniform, our scheme can effectively handle the range query. In light of security considerations, DO can employ data padding method or data compression techniques to ensure that the sizes of encrypted multi-dimensional data under leaf nodes are the same. Namely, the leaf nodes of SEITI cannot be distinguished according to the volumes of ciphertexts stored under them.

Data Update. If a user u wants to update some data indexed by SEITI, he/she should first obtain the update authorization from DO. Then, u generates a search token to find the leaf nodes of SEITI which covers the data that is needed to be updated. With the assistance of CS, the data that is needed to be updated is replaced with the new data.

8. Experiment

In our experiments, we conduct a comparative analysis of our scheme with [24] and [31]. In [24], DO builds an Efficient Interval Tree (EIT) as index, which is constructed based on Asymmetric Scalar-product Preserving Encryption with Noise (ASPEN[32]). ASPEN consists of many matrix calculations, which are implemented by using Jama Library version 1.0.3. To ensure the security of ASPEN, the dimensionality of matrixes is extended to 100 by using extra artificial dimensions[23, 24, 32]. In [31], DO proposes a multi-dimensional data order preserving encryption method (MDOPE). MDOPE utilizes a network data structure to efficiently organize multi-dimensional data. It employs prefix encoding and bloom filter techniques to process the values stored in the network data structure, thereby enabling queries on encrypted multi-dimensional data.

Our experiments are conducted on a Windows 10 computer with an AMD Ryzen 5 2500U CPU and 8GB of RAM. We generate a collection consisting of 2.5 ×105 data entries. These data entries are evenly distributed within the 2-dimensional range (0, 210] × (0,310] . Over the range, we build EIT, MDOPE and SEITI separately. In EIT and SEITI, the division parameters for the first and second dimensions are set to 2 and 3 respectively. Thus, the fan-outs of EIT and SEITI are 6 ( 6 = 2 × 3 ). In MDOPE, each node on the first dimension contains only one split data, while on the second dimension, each node contains two split data. As a result, the ranges on the first and second dimensions are divided into two and three smaller ranges respectively. Consequently, the fan-out of MDOPE is 5.

Index Construction. Fig. 4 shows the construction times of EIT and SEITI which increase exponentially. The construction time of MDOPE increase linearly. As EIT (and SEITI) is tree structure, the total number nodes in EIT (and SEITI) increases exponentially with the height of EIT (and SEITI). Thus, the construction time of EIT (and SEITI) increases exponentially. In EIT, the MDR of each node is encrypted by using a matrix. In SEITI, the MDR of each node is processed by one-way hash function and bloom filter. As one-way hash function and bloom filter are more efficient than matrix calculation, the index construction of SEITI is more efficient than that of EIT. In MDOPE, the split data in each index node only needs to be converted to a binary code, then padded with a random binary code, and finally processed by a bloom filter. Thus, the calculation is very efficient. The primary time overhead lies in inserting the encrypted multi-dimensional data into the index. Thus, when the height of the index increases (the amount of encrypted multi-dimensional data is fixed), the time of index construction grows slowly.

E1KOBZ_2024_v18n9_2717_14_f0001.png 이미지

Fig. 4. Index Construction.

Privilege Path Generation. As shown in Fig. 5, the generations of privilege paths in EIT and SEITI increase linearly and are very efficient. Compared with EIT, the privilege path generation in SEITI is more efficient. When the height of EIT (and SEITI) increases, the average length of these privilege paths increases linearly. Thus, the average time for generating these privilege paths also increases linearly. In EIT, all the values in a privilege path should be encrypted by using matrices. In SEITI, all the values in a privilege path are randomly chosen according to the secure enhanced MDR codes stored in the nodes of SEITI. Therefore, due to the efficiency of the underlying calculation method, the privilege path generation in SEITI is more efficient than in EIT. Compared with EIT and SEITI, as MDOPE does not consider the scenario of multiple users, there is no privilege path generation in MDOPE.

E1KOBZ_2024_v18n9_2717_14_f0002.png 이미지

Fig. 5. Privilege Path Generation.

Search Token Generation. Fig. 6 shows the search token generation in MDOPE which is very inefficient. The average time of search token generation increases with the height of EIT (and SEITI). Compared with MDOPE and EIT, the search token generation in SEITI is more efficient. In EIT (and SEITI), the search token is constructed according to the minimal cover of a queried range. When the number of elements in the minimal cover increases, a user should take more time to process these elements. Thus, it takes more time to generate the search token for a queried range. In the experiments, we randomly generate thousands of search tokens for queried ranges. When the height of EIT (and SEITI) increases, the average size of these queried ranges increases. This results that the average number of elements in the minimal covers of the queried ranges increases. Thus, the average time of search token generation increases with the height of EIT (and SEITI). In SEITI, each value in the search token is encrypted by a matrix. In EIT, each value in the search token is processed by using one-way hash function and bloom filter. As one-way hash function and bloom filter are more efficient than matrix calculations, the search token generation in SEITI is more efficient than EIT. In MDOPE, a queried range is first transformed into a l-length bit string. Then, the l-length bit string is padded by a 2-length bit string for comparison purpose, and a l-length random bit string for security purpose. Thus, the queried range is transformed into a (2l + 2)-length bit string. Next, the (2l + 2)-length bit string is processed by using prefix encoding. This process is very inefficient, because the (2l + 2)-length bit string implies that there are 22l+2 different bit strings which should be merged one by one to finally form only a few bit strings. Thus, the search token generation in MDOPE is very inefficient.

E1KOBZ_2024_v18n9_2717_14_f0003.png 이미지

Fig. 6. Search Token Generation.

Range query. The SEITI scheme utilizes a secure index to achieve efficient querying. In SEITI, the secure index is a tree structure where leaf nodes are associated with MDRs. The data within an MDR is encrypted as a unit. As the total amount of data increases, the amount of data in MDRs associated with leaf nodes also increases. During the querying process, the search token is compared only with the nodes of the secure index. Ultimately, the MDRs associated with the leaf nodes that intersect the query range are located, and the encrypted data within them is returned as the query result. Consequently, when the height of the secure index is fixed, an increase in the total amount of data does not affect the query efficiency of SEITI. Therefore, in our experiments, we keep the total amount of data unchanged and test the query efficiency by varying the height of the secure index. Fig. 7 shows the range query in SEITI which is more efficient than EIT and MDOPE. In EIT (and SEITI), a search token consists of a privilege path and a search tree. CS should find some nodes of EIT (and SEITI) which correspond to the nodes in the privilege path and the search tree according to the values stored in these nodes. Thus, if there are more nodes in a search token, it requires more calculations for CS to find the corresponding nodes in EIT (and SEITI). When the height of EIT (and SEITI) increases, the average amount of nodes in the search tokens increases, resulting that the average time of range query in EIT (and SEITI) increases. As (i) EIT and SEITI have similar tree structure and (ii) the search tokens in EIT and SEITI are both constructed by using the minimal covers of queried ranges, the average number of the nodes in search tokens are nearly the same. However, CS needs to execute multiple matrix calculations in EIT. In SEITI, CS only needs to execute bitwise AND calculations. Thus, the range query of SEITI is more efficient than that of EIT. In MDOPE, CS is unable to accurately determine whether an index node satisfies a search token. Therefore, CS may be required to tests the search token with the major split values in an index node and also the major child nodes of the index node. Therefore, if the height of index increases, the number of these split values and index nodes (that need to be tested) increases exponentially. Thus, the time of range query in MDOPE increases exponentially with the height of index. In summary, the range query in SEITI is more efficient than EIT and MDOPE.

E1KOBZ_2024_v18n9_2717_14_f0004.png 이미지

Fig. 7. Range query.

Data update and privilege revocation. We only test the data update and privilege revocation in ourscheme because both the EIT scheme and the MDOPE scheme do not provide data update and privilege revocation. SEITI supports efficient queries, which enables the cloud server to fast locate the data which should be updated and then replace the data with the new data. Thus, the efficiency of data update is related to its query efficiency. Consequently, as shown in Fig. 8, the efficiency of data update is also very efficient. To revoke the privileges of users, CS should update the index of SEITI, then regenerate and distribute new privilege paths to Us whose privileges have not been revoked. Thus, we test the efficiency of index update in SEITI and privilege path regeneration. As shown in Fig. 9, because the total number of index nodes increases exponentially with the index height, the time for updating the index increases exponentially. In SEITI, the computation overhead for regenerating a privilege path is very low. Thus, the regeneration of privilege path is very efficient. In our experiments, we only give the average time for regenerating a single privilege path. However, in practical applications, if the privileges of some users are revoked, DO needs to regenerate new privilege paths for the remaining users. Therefore, in multi-user scenarios, the time required for regenerating privilege paths depends on the total number of users whose privileges have not been revoked.

E1KOBZ_2024_v18n9_2717_16_f0001.png 이미지

Fig. 8. Data update.

E1KOBZ_2024_v18n9_2717_16_f0002.png 이미지

Fig. 9. Privilege Revocation.

9. Security Analysis of SEITI

In this section, we first analyze the security of the index and the ciphertexts of multi-dimensional data stored in CS. Then, we proceed to analyze the security of the range query.

Theorem 1. The proposed SEITI scheme is secure in terms of the leak function F .

Proof 1. In SEITI, the multi-dimensional data is encrypted with a secure encryption scheme. Thus, the security of all the multi-dimensional data is guaranteed by the security of the encryption scheme. Given an MDR of an index node and a data of a search tree node (in the search token), their encoding lengths are both l (please refer to Section 4 and 6). The MDR code and the data code are then processed by a one-way hash function (e.g. SHA families) with privilege values (privilege values are pseudo-random numbers). The numbers of the output elements from the one-way hash function are the same. Then, these elements are as the inputs of the bloom filter. Suppose two inputs of the bloom filter are a = < a1,a2,…,an> and b = <b1,b2,…,bn> respectively and there are m elements in a and b are the same, it can deduce that a1 = b1, a2 = b2, …, am = bm , am+1 ≠ bm+1 , …, an ≠ bn . According to the outputs of the bloom filter, an adversary only knows the leakage function F(mi, mj) = positiondiff(mi, mj). In summary, our scheme SEITI is secure with respect to the leakage function F .

Known Ciphertext Model: Adversaries can only access the ciphertexts of multi-dimensional data, the index SEITI, the submitted search tokens and the retrieved ciphertexts. According to the known ciphertext model, if an adversary records the search tokens and the retrieved ciphertexts, the adversary can obtain access patterns. Therefore, nothing beyond the access pattern should be leaked in the known ciphertext model. We adapt the definitions in [29, 30] for our proofs.

Definition 3. Search Pattern ( SP ): Let Q = {R1,…,Rm} be the set of m consecutive queried ranges, SP be a m × m binary matrix s.t. if MCRi = MCRj , then SP[i, j] = 1 . Otherwise, SP[i, j] = 0 (i, j = 1,… ,m, MCRi and MCRj are the minimal covers of Ri and Rj respectively).

Definition 4. Access Pattern ( AP ). Let Q = {R1,…,Rm} be the set of m consecutive queried ranges, T = {TR1,…, TRm} be the search tokens for Q = {R1,…, Rm} , C = {C1,…,Cm} be the retrieved ciphertext sets for T = {TR1,…,TRm}. The access pattern for Q is defined as AP = {AP(TR1) = C1,…, AP(TRm) = Cm} .

Definition 5. History ( Hm ). Let Q = {R1,…,Rm} be the set of m consecutive queried ranges and C = {C1,…,Cm} be the retrieved ciphertext sets for Q = {R1,…,Rm} . Then, Hm = (Q, C) is defined as a m-query history.

Definition 6. Trace ( γ ): Let Ap(Hm) be the access pattern of Hm , T = {TR1,…,TRm} be the search tokens for Q = {R1,…,Rm} , C = {C1,…,Cm} be the retrieved ciphertext sets for Q = {R1,…,Rm} . Then, the trace of Hm is defined as γ(Hm) = {AP(Hm), {TR1,…,TRm}, {C1,…,Cm}} .

Definition 7. View ( V ): Let SEITI be the index, ℂ be all the ciphertexts under SEITI , T = {TR1,…,TRm} be the search tokens for Q = {R1,…,Rm} , C = {C1,…,Cm} be the retrieved ciphertext sets for Q = {R1,…,Rm} . Then V(Hm) = {ℂ, SEITI, {TR1,…,TRm}, {C1,…,Cm}} is defined as the view of Hm . V(Hm) is the information that is accessible to an adversary.

We employ a simulator-based proof method similar to the one widely used in [29, 30]. In essence, if the adversary cannot differentiate between two histories with the same trace, it implies that the adversary cannot gain any additional information about the ciphertexts, the index, the search tokens and the query results[29, 30].

Theorem 2. Our scheme is secure in the known ciphertext model.

Proof 2. The notation S denotes the simulator which can simulate a view V(Hm)* . From an adversary's view, V(Hm)* and V(Hm) = {ℂ, SEITI, {TR1,…,TRm}, {C1,…,Cm}} exhibit computational indistinguishability. To achieve this, the simulator S does the followings:

(1) Multi-dimensional data are encrypted by using a secure encryption method. The ciphertexts output by the secure encryption method are computationally indistinguishable from random values. The set of these ciphertexts is ℂ which is available in SEITI . S randomly chooses |ℂ| values as ℂ*. Thus, ℂ* and ℂ exhibit computational indistinguishability.

(2) The privilege value pv of the root node in SEITI is randomly chosen. The privilege value of each node in SEITI is calculated by using pv and the hash function H . Thus, the privilege value of each node can also be seen as randomly chosen. Then, for each node in SEITI , its security enhanced MDR code [CMDR] is calculated by using its privilege value, one-way hash function and bloom filter. As [CMDR] is calculated by using its privilege value, [CMDR] can also be seen as a binary string in which each bit is randomly set to 0 or 1. As the index SEITI is available, S constructs an index SEITI* s.t. (i) The structures of SEITI and SEITI* are the same; (ii) Each node of SEITI* is associated with a binary string in which each bit is randomly set to 0 or 1; (iii) The lengths of the binary strings in the nodes of SEITI and SEITI* are the same. Thus, SEITI* and SEITI exhibit computational indistinguishability.

(3) The search token TRi (i = 1,2,…,m) is available. TRi consists of a privilege path PP and a tree TSRCH' . S constructs a search token TRi* by using the next two steps. First, S constructs a privilege path PP*. The lengths of PP and PP* are the same. Additionally, as (i) the binary strings in the nodes of SEITI* and SEITI are computationally indistinguishable and (ii) the values in PP* and PP are generated according to the binary strings in the nodes of SEITI* and SEITI , the values in PP and PP* are computationally indistinguishable. Thus, PP and PP* exhibit computational indistinguishability. Second, S constructs a tree TSRCH'* . TSRCH' and TSRCH'* have the same structure. S finds a subtree T* in * SEITI* : (i) T* and TSRCH'* have the same structure; (ii) Each node in TSRCH'* corresponds to a node in T* . For each node in TSRCH'* , S randomly sets its binary string to a s.t. a & b = a , where a is the binary string in the node of TSRCH'* and b is the binary string in the corresponding node of T* (& denotes bitwise AND operation). As the binary strings in the nodes of TSRCH' are generated according to the privilege values and these privilege values can be seen as randomly chosen, these binary strings can be seen as randomly chosen. As the binary strings in the nodes of TSRCH'* are also generated according to the privilege values and these privilege values can be seen as randomly chosen, these binary strings can also be seen as randomly chosen. As (i) the binary strings in the nodes of TSRCH' and TSRCH'* are computationally indistinguishable and (ii) TSRCH' and TSRCH'* have the same structure, TSRCH' and TSRCH'* are computationally indistinguishable. As PP and TSRCH' in TRi are computationally indistinguishable from PP* and TRi* respectively (i = 1,…,m), the search tokens {TR1,…,TRm} and {TR1*,…,TRm*} exhibit computational indistinguishability.

(4) S generates {C1*,…,Cm*} , where Ci* ( i = 1,…,m ) denotes an empty set. The ciphertext set ℂ and the set of retrieved ciphertext result sets {C1,…,Cm} are available. For each ciphertext c∈ℂ , if c ∈ C1 , S chooses a ciphertext c* in ℂ* and then adds c* to C1* . Next, S judges whether c ∈ C2 . If c ∈ C2 , S also adds c* to C2* . By using the method iteratively, S can obtain {C1*,…,Cm*} . It is obviously that |Ci|=|Ci*| . As (i) |Ci|=|Ci*| and (ii) the ciphertexts in ℂ and ℂ* are computationally indistinguishable, {C1,…,Cm} and {C1*,…,Cm*} exhibit computational indistinguishability.

Given that each element of V(Hm) and V(Hm)* exhibit computational indistinguishability, we can conclude that our scheme meets the security definition outlined in Theorem 2.

10. Conclusion and Discussion

In this work, we propose SEITI. SEITI can be used to perform range queries on the ciphertexts of multi-dimensional data in the scenario of multiple users. Compared with order-preserving encryption schemes[21, 22], as the ciphertexts in the same MDR are computationally indistinguishable, SEITI can effectively preserve the ordering of ciphertexts. Compared with bucketization methods[7, 8, 9, 10], as the privilege values can only calculated by using one-way hash function, SEITI can support multiple users by distributing different privilege values to different users. Compared with public-key based ciphertexts search methods[11, 12, 13], as the bitwise AND operations are used, the range query in SEITI is more efficient. Compared with bloom filter based methods[15,16,17,18], SEITI can support multiple users to perform range queries over ciphertexts. SEITI also exists some drawbacks, including the absence of dynamic data update, secret parameter management and efficient privilege revocation. In future endeavors, we will prioritize addressing these critical issues.

Acknowledgments

This work is supported by the National Natural Science Foundation of China (No. 61962029, 62262033, 62062045, 62341206), the Zhejiang Province Visiting Engineer Cooperation Project (No. FG2023061).

References

  1. S. Zhang, L. T. Yang, J. Feng et al., "A tensor-network-based big data fusion framework for Cyber-Physical-Social Systems (CPSS)," Information Fusion, vol.76, pp.337-354, 2021.
  2. Z. Mei, B. Wu, S. Tian et al., "Fuzzy Keyword Search Method over Ciphertexts supporting Access Control," KSII Transactions on Internet and Information Systems (TIIS), vol.11, no.11, pp.5671-5693, 2017.
  3. J. Feng, L. T. Yang, Q. Zhu, and K.-K. R. Choo, "Privacy-Preserving Tensor Decomposition Over Encrypted Data in a Federated Cloud Environment," IEEE Transactions on Dependable and Secure Computing, vol.17, no.4, pp.857-868, 2020.
  4. J. Feng, L. T. Yang, R. Zhang, W. Qiang, and J. Chen, "Privacy Preserving High-Order Bi-Lanczos in Cloud-Fog Computing for Industrial Applications," IEEE Transactions on Industrial Informatics, vol.18, no.10, pp.7009-7018, 2022.
  5. Z. Wu, R. Wang, Q. Li, X. Lian, G. Xu, E. Chen, X. Liu, "A Location Privacy-Preserving System Based on Query Range Cover-Up or Location-Based Services," IEEE Transactions on Vehicular Technology, vol.69, no.5, pp.5244-5254, 2020.
  6. D. Sharma, "Searchable encryption : A survey," Information Security Journal: A Global Perspective, vol.32, no.2, pp.76-119, 2023.
  7. B. Hore, S. Mehrotra, M. Canim, and M. Kantarcioglu, "Secure multidimensional range queries over outsourced data," The VLDB Journal, vol.21, pp.333-358, 2012.
  8. P. Wang, C. V. Ravishankar, "Secure and efficient range queries on outsourced databases using Rp-trees," in Proc. of 2013 IEEE 29th International Conference on Data Engineering (ICDE), pp.314-325, 2013.
  9. Y. Lee, "Secure Ordered Bucketization," IEEE Transactions on Dependable and Secure Computing, vol.11, no.3, pp.292-303, 2014.
  10. W. Lin et al., "Towards Secure and Efficient Equality Conjunction Search Over Outsourced Databases," IEEE Transactions on Cloud Computing, vol.10, no.2, pp.1445-1461, 2022.
  11. Y. Lu, "Privacy-Preserving Logarithmic-time Search on Encrypted Data in Cloud," in Proc. of NDSS Symposium 2012, 2012.
  12. Z. Mei et al., "Secure multi-dimensional data retrieval with access control and range query in the cloud," Information Systems, vol.122, 2024.
  13. H. Qiao et al., "Ciphertext Range Query Scheme Against Agent Transfer and Permission Extension Attacks for Cloud Computing," IEEE Internet of Things Journal, vol.11, no.10, pp.17975-17988, 2024.
  14. B. H. Bloom, "Space/time trade-offs in hash coding with allowable errors," Communications of the ACM, vol.13, no.7, pp.422-426, 1970.
  15. R. Li, A. X. Liu, A. L. Wang, and B. Bruhadeshwar, "Fast and Scalable Range Query Processing With Strong Privacy Protection for Cloud Computing," IEEE/ACM Transactions on Networking, vol.24, no.4, pp.2305-2318, 2016.
  16. B. Wang, M. Li, and H. Wang, "Geometric Range Search on Encrypted Spatial Data," IEEE Transactions on Information Forensics and Security, vol.11, no.4, pp.704-719, 2016.
  17. N. Cui, J. Li, X. Yang, B. Wang, M. Reynolds, and Y. Xiang, "When Geo-Text Meets Security: Privacy-Preserving Boolean Spatial Keyword Queries," in Proc. of 2019 IEEE 35th International Conference on Data Engineering (ICDE), pp.1046-1057, 2019.
  18. H. Mahdikhani et al., "Achieving Efficient and Privacy-Preserving Range Query in Fog-enhanced IoT with Bloom Filter," in Proc. of ICC 2020 - 2020 IEEE International Conference on Communications (ICC), pp.1-6, 2020.
  19. C. Zuo, S.-F. Sun, J. K. Liu, J. Shao and J. Pieprzyk, "Dynamic Searchable Symmetric Encryption Schemes Supporting Range Queries with Forward (and Backward) Security," in Proc. of 23rd European Symposium on Research in Computer Security, ESORICS 2018, LNSC, vol.11099, pp.228-246, 2018.
  20. J. Wang, and S. S. M. Chow, "Forward and Backward-Secure Range-Searchable Symmetric Encryption," in Proc. of Privacy Enhancing Technologies Symposium, vol.2022, no.1, pp.28-48, 2022.
  21. R. A. Popa, F. H. Li, and N. Zeldovich, "An Ideal-Security Protocol for Order-Preserving Encoding," in Proc. of 2013 IEEE Symposium on Security and Privacy, pp.463-477, 2013.
  22. F. Kerschbaum, and A. Schroepfer, "Optimal Average-Complexity Ideal-Security Order-Preserving Encryption," in Proc. of the 2014 ACM SIGSAC Conference on Computer and Communications Security, pp.275-286, 2014.
  23. W. K. Wong, D. W.-l. Cheung, B. Kao, and N. Mamoulis, "Secure kNN computation on encrypted databases," in Proc. of the 2009 ACM SIGMOD International Conference on Management of data, pp.139-152, 2009.
  24. Z. Mei, H. Zhu, Z. Cui, Z. Wu, G. Peng, B. Wu, and C. Zhang, "Executing multi-dimensional range query efficiently and flexibly over outsourced ciphertexts in the cloud," Information Sciences, vol.432, pp.79-96, 2018.
  25. L. Chen, G. Cong, C. S. Jensen, and D. Wu, "Spatial keyword query processing: an experimental evaluation," in Proc. of the VLDB Endowment, vol.6, no.3, pp.217-228, 2013.
  26. B. Wang, M. Li, and L. Xiong, "FastGeo: Efficient Geometric Range Queries on Encrypted Spatial Data," IEEE Transactions on Dependable and Secure Computing, vol.16, no.2, pp.245-258, 2019.
  27. S. Faber, S. Jarecki, H. Krawczyk, Q. Nguyen, M. Rosu, and M. Steiner, "Rich Queries on Encrypted Data: Beyond Exact Matches," in Proc. of 20th European Symposium on Research in Computer Security, LNSC, vol.9327, pp.123-145, 2015.
  28. I. Demertzis, S. Papadopoulos, O. Papapetrou, A. Deligiannakis, and M. Garofalakis, "Practical Private Range Search Revisited," in Proc. of the 2016 International Conference on Management of Data (SIGMOD 2016), pp.185-198, 2016.
  29. M. Kuzu, M. S. Islam, and M. Kantarcioglu, "Efficient Similarity Search over Encrypted Data," in Proc. of 2012 IEEE 28th International Conference on Data Engineering, pp.1156-1167, 2012.
  30. B. Wang, S. Yu, W. Lou, and Y. T. Hou, "Privacy-preserving multi-keyword fuzzy search over encrypted data in the cloud," in Proc. of IEEE INFOCOM 2014 - IEEE Conference on Computer Communications, pp.2112-2120, 2014.
  31. Y. Zhan, D. Shen, P. Duan et al., "MDOPE: Efficient multi-dimensional data order preserving encryption scheme," Information Sciences, vol.595, pp.334-343, 2022.
  32. S. Su, Y. Teng, X. Cheng, K. Xiao, G. Li, and J. Chen, "Privacy-Preserving Top-k Spatial Keyword Queries in Untrusted Cloud Environments," IEEE Transactions on Services Computing, vol.11, no.5, pp.796-809, 2018.