Page 84 - ITU Journal Future and evolving technologies Volume 2 (2021), Issue 7 – Terahertz communications
P. 84

ITU Journal on Future and Evolving Technologies, Volume 2 (2021), Issue 7




          In  this  work,  we  mainly  focus  on  the  beam  alignment   to be aligned with an optimal beam. Thus, the BA problem
          problem  for  SU‑MIMO  transmission.  Nevertheless,  the   is transformed to    parallel BA problems, where    is the
                                                                                 
                                                                                                            
          proposed beam alignment framework can be adjusted to   number of sub‑bands. For the   th sub‑band, we interpret
          the  multi‑user  case  in  a  straightforward  manner  due  to   its receive power as the measured reward    , i.e.,
                                                                                                      
          the sub‑band‑wise beam alignment setting. For the multi‑
                                                                                     0 (  )+          −1
          user  scenario,  different  users  are  assigned  to  different
                                                                                                2
                                                                                =    ∑      ‖r[  ]‖ ,       (31)
          sub‑bands such that the sub‑band‑wise beam alignment                                  2
                                                                                      =   0 (  )
          problem is reformulated to a multi‑user beam alignment
          problem.  This  idea  is  similar  to  the  frequency  division
                                                               where  r[  ]  is  the  receive  signal  vector  for  the    th  sub‑
          multiple access in conventional communication systems.
                                                               carrier.  Consider the beam codes in the hiearchical DFT
          The only change here is the determination of the number
                                                               codebook as the arms in an MAB framework.  Due to the
          of sub‑bands, which should be chosen based on both the
                                                               randomness of data and noise, the reward    is a random
                                                                                                      
          number of users and the number of transmit antennas.
                                                               variable.  This indicates that our LOS BA problem can be
          4.1.2   NLOS beam alignment                          classi ied as a stochastic MAB problem.  Assuming a sta‑
                                                               tionary indoor scenario, we consider the channel as con‑
          In  the  NLOS  scenario,  the  receive  signal  consists  of  se-  stant  during  the  BA  process.  For simplicity,  the  reward
                                                                                                              2
                                                               is modeled as Gaussian distributed with variance        .
          veral  NLOS  MPCs.  To  utilize  the  power  of  the  NLOS                                             
                                                               The average payoff function   [   (w)] for a beam code w
          com‑  ponents,  the  hierarchical  k‑means  codebook                                                   
          described in Section  3  is  implemented  in  this  scenario.   in  the  hierarchical  DFT  codebook  is  equivalent  to  the
          Due  to  the high  frequency  selectivity  of  the  considered   average receive power    (w),
                                                                                      
                                                                                         
          THz  NLOS  channel,  even  the  frequency  response  of
                                                                                 [   (w )] =    (w ),       (32)
          two  neighbor‑  ing  subcarrier  may  be  quite  distinct.                             
          Thus, the sub‑band beam alignment as used for the LOS
                                                               In the time slot   , the transmitter selects a beam code
          scenario  is  not  con‑ sidered in the NLOS case.  Instead,
                                                               w (  ) from the hierarchical DFT codebook W         for the
                                                                   
          speci ic beams need to be aligned to each subcarrier used     th sub‑band. At the receiver, the power    (w (  )) of the
          in an SC‑FDMA system for a fully optimum performance.                                          
                                                                 th sub‑band is measured and fed back to the transmit‑
          The  BA  problem  for  the    t h  subcarrier  is  equivalent  to
                               ∗                               ter. At the end of time slot   , the transmitter obtains the
           inding the beam code w from the hierarchical k‑means   measured rewards of all sub‑bands for this time slot and
                                 
          codebook with maximal mean receive power for the   t h
                                                               decides which arm to select for which sub‑band based on
          subcarrier. Here, the mean receive power    (w ) for pre‑   a speci ic rule for the next time slot    + 1.
                                                
                                                   
          coding vector w and the   t h subcarrier is given by  For a stochastic MAB problem, the performance of the al‑
                         
                                                2
                                 
                              
                     (w ) = w H [  ]H[  ]w +       .  (30)     gorithm is evaluated via the expected cumulative regrets
                        
                     
                                          
                              
                                                  
                                                               over    time slots. Here, the expected cumulative regrets
                                                               for the   th sub‑band    (  ) is de ined as the expected dif‑
          Similar to the extension to multi‑user transmission in the                
                                                               ference between the cumulative reward of the optimal
          LOS  case,  the  beam  alignment  framework  for  the  NLOS
                                                                     ∗
                                                               arm w and the cumulative reward of the proposed algo‑
          scenario can be adjusted to a multi‑user transmission as     
                                                               rithm for   th sub‑band, given by
          well.  In  the  case  of  multi‑user  transmission,  different
          users are assigned to different subcarriers such that the
                                                                                   
          subcarrier‑wise beam alignment algorithm is modi ied to           (  ) = ∑ (   (w ) −    (w (  ))) .  (33)
                                                                                         ∗
          a multi‑user beam alignment algorithm.                                                    
                                                                                   =1
          4.1.3   HBA problem formulation                      The objective of the design of the MAB algorithm is to  ind
                                                               a selection policy that minimizes the sum expected cumu‑
          To accelerate the BA process in the large‑scale MIMO case,   lative regret           (  ) over all sub‑bands, i.e.,           (  ) =
          we reformulate the BA problem to a stochastic MAB prob‑   ∑           (  ).
                                                                        
                                                                   =1
          lem for stationary environments.  The transmission sys‑   The BA problem for the NLOS scenario is similar to that
          tem is  considered  as  a  time  slotted  system  with     time   for the LOS scenario. Here, the BA problem can be trans‑
          slots to search for the optimal beam. At the begining of the   formed to    parallel BA problem. For the   th subcarrier,
                                                                                              2
          BA  phase,  the  propagation  scenario  can  be  determined   we interpret its receive power ‖r[  ]‖ as the measured re‑
                                                                                              2
          at the transmitter side based on the feedback of the re‑   ward    , and consider the beam codes in the hierarchical
                                                                       
          ceive power pro ile. If the propagation environment is an   k‑meanscodebookasthearmsinanMABframework. The
          LOS scenario, the sub‑band BA discussed in the LOS beam   reward is modeled as Gaussian distributed with variance
                                                                 2
          alignment  is  adopted  for  the  following  BA  procedure.      . The average payoff function   [   (w )] for beam code
                                                                                               
                                                                                                   
          Otherwise,  the  subcarrier  BA  introduced  in  the  NLOS   w is equivalent to the average receive power    (w ),
                                                                                                          
                                                                                                             
                                                                   
          beam alignment is utilized.
                                                                                              
                                                                                      2
                                                                                           
                                                                                                             2
          If an LOS component can be received, all     subcarriers     [   (w )] =    (w ) =    w H [  ] H[  ] w +       .
                                                                                                        
                                                                                                                
                                                                                        
                                                                               
                                                                                  
                                                                         
                                                                     
                                                                                           
          will be divided into    sub‑bands and each sub‑band has                                           (34)
                             
          72                                 © International Telecommunication Union, 2021
   79   80   81   82   83   84   85   86   87   88   89