Page 87 - ITU Journal Future and evolving technologies Volume 2 (2021), Issue 7 – Terahertz communications
P. 87

ITU Journal on Future and Evolving Technologies, Volume 2 (2021), Issue 7





              1400                                              140
                     Exp3 10 dB                                        HBA SNR=20dB
                     Exp3 0 dB                                         HBA SNR=10dB
              1200   HBA 0 dB                                   120    Exp3 SNR=10dB
                     HBA 10 dB                                         Exp3 SNR=20dB
              1000                                              100
             Cumulative regret  800                            Cumulative regret  80

              600
                                                                 60
              400                                                40

              200                                                20
                0                                                0
                 0        50       100      150       200         0        50       100       150      200
                                 Time slots                                       Time slots
             Fig. 7 – Cumulative regret of different algorithms in LOS scenario.  Fig. 8 – Cumulative regret of different algorithms in NLOS scenario.


          receiver as well. This scheme is related to a performance   over the entire codebook is avoided in HBA. Meanwhile,
          lower bound.                                         Exp3 operates like a random searching in the beginning,
          Exhaustive  search:  Exhaustive  search  is  a  naive  beam   which results in a large number of time slots to converge.
          alignment  approach.  In  this  scheme,  the  transmitter
          applies   all   beam   codes   from   the   prede ined   5.2 Convergence behavior
          codebook  several  times  to  obtain  the  rewards  of  all
          beam  codes  [9].  Then  the  beam  code  with  the
                                                                     140
          largest  measured  reward  is  selected  for  usage.  This
          beamforming  method  ensures  that  the  optimal  beam     120
          code  from  the  available codebook  is  always  obtained.
          Its  performance  serves for  quantifying  the  loss  due  to   100
          the  beam  misalignment caused by the HBA.
          Exponential  weights  (Exp3)  algorithm:   The  Exp3       Nnumber of time slots untill convergence  80
          algorithm  is  based  on  the  adversarial  MAB  framework.   60                          LOS HBA
          In  [16],  the  authors  advocated  applying  the  Exp3                                   NLOS HBA
                                                                                                    LOS Exp3
          algorithm to  the  beam  alignment  problem.  Compared  to   40                           NLOS Exp3
          HBA,  the Exp3  algorithm  does  not  take  the  hierarchical   20
                                                                      0   2  4   6  8   10  12  14  16  18  20
          structure of the codebook into account, which results in a                  SNR (dB)
          slow convergence  behavior  for  the  beam  code  selection.
                                                               Fig. 9 – Convergence behavior of different BA schemes in both LOS and
          Its  convergence behavior can be taken as a reference for
                                                               NLOS scenario.
          the convergence behavior of the HBA.
                                                               Figure 9 illustrates the convergence behavior of the HBA
          5.1  Cumulative regret                               in both LOS and NLOS scenario.  In high SNR conditions,
                                                               only 30 time slots are required for convergence to the  i‑
          Figures  7  and  8  depict  the  sum  cumulative  regret    (    )   nally selected beam, which is much faster compared to the
          performance  of  the  proposed  HBA  algorithm  in  LOS  and   Exp3 algorithm,  requiring nearly 100 time slots  to con‑
          NLOS  scenarios,  respectively,  where  the  curves  have been   verge [9]. Furthermore, the convergence speed is related
          averaged  over  the  receiver  locations.  Here,  SNR is  de ined   to the SNR, i.e., the HBA needs more time slots to converge
          as  the  transmit  power  of  one  antenna divided  by the noise   to  the  optimal  beam  under  low  SNR,  suggested  also  by
          power  at  one  receive  antenna.  First,  a  bounded  regret   Fig.  9.  The reason is that in low SNR, the measured re‑
          behavior  is  observed  for  both  LOS  and  NLOS  scenarios,   wards are severely affected by the noise and the HBA re‑
          which  complies  with  the  conclusion from [9]. In addition,   quires more feedback information from the receiver to de‑
          the cumulative regret and the noise power are positively   termine the mean reward. In all SNR conditions, our pro‑
          correlated, which indicates that the HBA  needs  more  time   posed approach converges faster compared to the bench‑
          slots  to  converge  to  the  optimal  beam  under  low  SNR.   mark schemes.
          However,  under  all  SNR  conditions,  the  HBA  can  achieve
          nearly  100%  beam  accuracy  after  40  time  slots,  as   5.3  BER performance
          con irmed  by  the  bounded  regret  behavior.  Moreover,
          under all SNR conditions in both LOS and  NLOS  scenarios,   In  Fig.  10,  the  BER  performance  of  the  different  beam‑
          HBA  performs  better  than  Exp3 with  respect  to  cumulative   forming  schemes  is  shown  for  the  LOS  scenario.  Appa-
          regrets.  This  is  due  to  the fact  that  HBA  utilizes  the   rently,  the  performance  of  the  HBA  is  signi icantly  better
          hierarchical  structure  of  the  codebook.  Thus,  searching  than  that  of  random  beamforming.  Furthermore,  the  HBA





                                             © International Telecommunication Union, 2021                    75
   82   83   84   85   86   87   88   89   90   91   92