Data Dissemination
Data Dissemination
Data Dissemination
26
Pull-based mechanisms function in the
following manner:
1. The bandwidth used for the uplink channel depends upon the
number of pull requests.
2. A pull threshold is selected. This threshold limits the number of
pull requests in a given period of time. This controls the number
of server interruptions.
3. A mechanism is adopted to prevent the device from pulling from
a cell, which has handed over the concerned device to another
cell. On device handoff, the subscription is cancelled or passed on
to the new service provider cell
• Disadvantages:
– The server faces frequent interruptions and queues of
requests at the server may cause congestion in cases of sudden
rise in demand for certain data record.
– In on-demand mode, another disadvantage is the energy and
bandwidth required for sending the requests for hot items and
temporal records
Hybrid Mechanisms
• A hybrid data-delivery mechanism integrates pushes and pulls. The
hybrid mechanism is also known as interleaved-push-and-pull (IPP)
mechanism.
• The devices use the back channel to send pull requests for records,
which are not regularly pushed by the front channel.
• The front channel uses algorithms modeled as broadcast disks and
sends the generated interleaved responses to the pull requests.
• The user device or computing system pulls as well receives the
pushes of the data records from the service provider's application
server or database server or from a set of distributed computing
systems.
• Best example would be a system for advertising and selling
music albums. The advertisements are pushed and the mobile
devices pull for buying the album.
230
Hybrid interleaved push-pull-based data-delivery mechanism 231
• The above figure shows a hybrid
interleaved, push-pull-based data-delivery
mechanism in which a device pulls
(demands) from a server and the server
interleaves the responses along with the
pushes of the data records generated by a
set of distributed computing systems.
232
Hybrid mechanisms function in the
following manner:
1. There are two channels, one for pushes by front channel and the other
for pulls by back channel.
2. Bandwidth is shared and adapted between the two channels depending
upon the number of active devices receiving data from the server and the
number of devices requesting data pulls from the server.
3. An algorithm can adaptively chop the slowest level of the scheduled
pushes successively The data records at lower level where the records are
assigned lower priorities can have long push intervals in a broadcasting
model.
Advantages of Hybrid mechanisms:
– The number of server interruptions and queued requests are significantly
reduced.
Disadvantages:
– IPP does not eliminate the typical server problems of too many
interruptions
and queued requests.
– Another disadvantage is that adaptive chopping of the slowest level of
scheduled pushes. 233
Selective Tuning and Indexing
Techniques
• The purpose of pushing and adapting to a broadcast
model is to push records of greater interest with greater
frequency in order to reduce access time or average
access latency.
• A mobile device does not have sufficient energy to
continuously cache the broadcast records and hoard
them in its memory.
• A device has to dissipate more power if it gets each
pushed item and caches it.
• Therefore, it should be activated for listening and
caching only when it is going to receive the selected data
records or buckets of interest.
• During remaining time intervals, that is, when the
broadcast data buckets or records are not of its interest,
it switches to idle or power down mode.
234
• Selective tuning is a process by which client device selects only the
required pushed buckets or records, tunes to them, and caches
them.
• Tuning means getting ready for caching at those instants and
intervals when a selected record of interest broadcasts. Broadcast
data has a structure and overhead.
• Data broadcast from server, which is organized into buckets, is
interleaved. The server prefixes a directory, hash parameter (from
which the device finds the key), or index to the buckets.
• These prefixes form the basis of different methods of selective
tuning. Access time (taccess) is the time interval between pull
request from device and reception of response from
broadcasting or data pushing or responding server. Two
important factors affect taccess –
– (i) number and size of the records to be broadcast and
– (ii) directory- or cache-miss factor (if there is a miss then
the response from the server can be received only in subsequent
broadcast cycle or subsequent repeat broadcast in the cycle).
235
Directory Method
• One of the methods for selective tuning involves
broadcasting a directory as overhead at the beginning of
each broadcast cycle.
• If the interval between the start of the broadcast
cycles is T, then directory is broadcast at each
successive intervals of T.
• A directory can be provided which specifies when a
specific record or data item appears in data being
broadcasted.
• For example, a directory (at header of the cycle) consists
of directory start sign, 10, 20, 52, directory end sign.
• It means that after the directory end sign, the 10th,
20th and 52nd buckets contain the data items in
response to the device request. The device selectively
tunes to these buckets from the broadcast data.
236
• A device has to wait for directory consisting of start sign,
pointers for locating buckets or records, and end sign.
• Then it has to wait for the required bucket or record
before it can get tuned to it and, start caching it.
• Tuning time ttune is the time taken by the device for
selection of records.
• This includes the time lapse before the device starts
receiving data from the server. In other words, it is the
sum of three periods—time spent in listening to the
directory signs and pointers for the record in order to
select a bucket or record required by the device, waiting
for the buckets of interest while actively listening (getting
the incoming record wirelessly), and caching the
broadcast data record or bucket.
237
• The device selectively tunes to the
broadcast data to download the records of
interest.
• When a directory is broadcast along with the
data records, it minimizes ttune and
taccess.
• The device saves energy by remaining active
just for the periods of caching the directory
and the data buckets.
• For rest of the period (between directory end
sign and start of the required bucket), it
remains idle or performs application tasks.
Without the use of directory for tuning, ttune =
taccess and the device is not idle during any 238
Hash-Based Method
• Hash is a result of operations on a pair of key and record.
• Advantage of broadcasting a hash is that it contains a fewer bits
compared to key and record separately.
• The operations are done by a hashing function. From the server end
the hash is broadcasted and from the device end a key is extracted
by computations from the data in the record by operating the data
with a function called hash function (algorithm).
• This key is called hash key.
• Hash-based method entails that the hash for the hashing parameter
(hash key) is broadcasted.
• Each device receives it and tunes to the record as per the extracted
key.
• In this method, the records that are of interest to a device or those
required by it are cached from the broadcast cycle by first extracting
and identifying the hash key which provides the location of the
record.
239
This helps in tuning of the device. Hash-based method can be
described as follows:
1. A separate directory is not broadcast as overhead with each
broadcast cycle.
2. Each broadcast cycle has hash bits for the hash function H, a shift function
S, and the data that it holds. The function S specifies the location of the
record or remaining part of the record relative to the location of hash and,
thus, the time interval for wait before the record can be tuned and cached.
3. Assume that a broadcast cycle pushes the hashing parameters H(R ) [H and
í
S] and record R . The functions H and S help in tuning to the H(R ) and hence
í í
to R as follows—H gives a key which in turn gives the location of H(R ) in the
í í
broadcast data. In case H generates a key that does not provide the location
of H(R ) by itself, then the device computes the location from S after the
í
location of H(R ). That location has the sequential records R and the devices
í í
240
Index-Based Method
• Indexing is another method for selective tuning. Indexes
temporarily map the location of the buckets.
• At each location, besides the bits for the bucket in
record of interest data, an offset value may also be
specified there.
• While an index maps to the absolute location from
the beginning of a broadcast cycle, an offset index is a
number which maps to the relative location after the end
of present bucket of interest.
• Offset means a value to be used by the device along
with the present location and calculate the wait period for
tuning to the next bucket. All buckets have an offset to
the beginning of the next indexed bucket or item.
241
• Indexing is a technique in which each data
bucket, record, or record block of interest is
assigned an index at the previous data bucket,
record, or record block of interest to enable
the device to tune and cache the bucket after
the wait as per the offset value.
• The server transmits this index at the beginning
of a broadcast cycle as well as with each bucket
corresponding to data of interest to the device.
• A disadvantage of using index is that it
extends the broadcast cycle and hence
increases taccess.
242
• The index I has several offsets and the bucket type and flag information. A
typical index may consist of the following:
1. Ioffset(1) which defines the offset to first bucket of nearest index.
2. Additional information about Tb, which is the time required for caching the bucket
bits in full after the device tunes to and starts caching the bucket. This enables
transmission of buckets of variable lengths.
3. Ioffset (next) which is the index offset of next bucket record of interest.
4. Ioffset(end) which is the index offset for the end of broadcast cycle and the start of next
cycle. This enables the device to look for next index I after the time interval as per
Ioffset(end). This also permits a broadcast cycle to consist of variable number of buckets.
5. Itype , which provides the specification of the type of contents of next bucket to be tuned,
that is, whether it has an index value or data.
6. A flag called dirty flag which contains the information whether the indexed buckets
defined by Ioffset(1) and Ioffset(next) are dirty or not. An indexed bucket being dirty
means that it has been rewritten at the server with new values. Therefore, the device
should invalidate the previous caches of these buckets and update them by tuning to
and caching them.
243
Distributed Index Based
Method
• Distributed index-based method is an improvement on
the (I, m) method.
• In this method, there is no need to repeat the
complete
index again and again.
• Instead of replicating the whole index m times, each
index segment in a bucket describes only the offset I'
of data items which immediately follow. Each index I is
partitioned into two parts—I' and I".
• I" consists of unrepeated k levels (sub-indexes),
which do not repeat and I' consists of top I repeated
levels (sub-indexes).
• Assume that a device misses I(includes I' and I' once)
transmitted at the beginning of the broadcast cycle. As I'
is repeated m - I times after this, it tunes to the pushes
lesser
by usinglevels.
I', The access latency is reduced as I' has 244
Flexible Indexing Method
• Assume that a broadcast cycle has number of data segments with each of
the segments having a variable set of records. For example, let n records,
Ro to Rn-1, be present in four data segments, R() to Ri-1, Ri to Rj-1 , Rj to
Rj-1 and Rk to Rn-1.
• Some possible index parameters are (i) Iseg,having just 2 bits for
the offset, to specify the location of a segment in a broadcast cycle,
(ii) Irec, having just 6 bits for the offset, to specify the location of a record
of interest within a segment of the broadcast cycle, (iii) Ib, having just 4
bits for the offset, to specify the location of a bucket of interest within a
record present in one of the segments of the broadcast cycle.
• Flexible indexing method provides dual use of the parameters (e.g.,
use of Iseg or Irec in an index segment to tune to the record or buckets of
interest) or multi-parameter indexing (e.g., use of Iseg, Irec, or Ib in an
index segment to tune to the bucket of interest).
245
• Assume that broadcast cycle has m sets of
records (called segments). A set of binary bits
defines the index parameter Iseg,. A local index
is then assigned to the specific record (or
bucket). Only local index (Irec or Ib) is used in
(Iloc, m) based data tuning which corresponds
to the case of flexible indexing method being
discussed. The number of bits in a local index is
much smaller than that required when each
record is assigned an index. Therefore, the
flexible indexing method proves to be beneficial.
246