Recently, the Internet of Things (loT) has attracted more and more attention. Multimedia sensor network plays an important role in the IoT, and audio event detection in the multimedia sensor net- works is one of the...Recently, the Internet of Things (loT) has attracted more and more attention. Multimedia sensor network plays an important role in the IoT, and audio event detection in the multimedia sensor net- works is one of the most important applications for the Internet of Things. In practice, it is hard to get enough real-world samples to generate the classifi- ers for some special audio events (e.g., car-crash- ing in the smart traffic system). In this paper, we introduce a TrAdaBoost-based method to solve the above problem. By using the proposed approach, we can train a strong classifier by using only a tiny amount of real-world data and a large number of more easily collected samples (e.g., collected from TV programs), even when the real-world data is not sufficient to train a model alone. We deploy this ap- proach in a smart traffic system to evaluate its per- formance, and the experiment evaluations demonstrate that our method can achieve satisfying results.展开更多
Coexistence of satellite and terrestrial wireless communication systems in the same frequency band is quite promising for addressing the challenge of spectrum scarcity. To cope with the inevitable inter-system interfe...Coexistence of satellite and terrestrial wireless communication systems in the same frequency band is quite promising for addressing the challenge of spectrum scarcity. To cope with the inevitable inter-system interference, radio resource allocation at both sides should be carefully re-optimized. In this paper, we focus on a scenario where a satellite communication system and a terrestrial distributed antenna system(DAS) coexist via spectrum sharing. We particularly utilize the radio map(RM) to reduce the system overhead for channel acquisition. Based on the large-scale channel state information at the transmitter(CSIT), which is derived from the RM, we propose an optimized power allocation scheme to improve the achievable sum rate of the terrestrial system. For the satellite side, an opportunistic user scheduling scheme is presented, to reduce the harmful leakage interference to the terrestrial mobile users. Simulation results demonstrate that the proposed RM-based coordination scheme can significantly promote the performance of satellite terrestrial coexistence, although the small-scale channel fading has been ignored in the formulated optimization.展开更多
Multichannel audio signal is more difficult to be compressed than mono and stereo ones.A novel multichannel audio signal compression method based on tensor representation and decomposition is proposed in this paper.Th...Multichannel audio signal is more difficult to be compressed than mono and stereo ones.A novel multichannel audio signal compression method based on tensor representation and decomposition is proposed in this paper.The multichannel audio is represented with 3-order tensor space and is decomposed into core tensor with three factor matrices in the way of channel,time and frequency.Only the truncated core tensor is transmitted which will be multiplied by the pre-trained factor matrices to reconstruct the original tensor space.Objective and subjective experiments have been done to show a very noticeable compression capability with an acceptable output quality.The novelty of the proposed compression method is that it enables both high compression capability and backward compatibility with limited signal distortion to the hearing.展开更多
High efficiency audio compression is the basic technology in audio involved multimedia communications. Downmixing and parametric coding is efficient coding scheme with wide applications in some up-to-date audio codecs...High efficiency audio compression is the basic technology in audio involved multimedia communications. Downmixing and parametric coding is efficient coding scheme with wide applications in some up-to-date audio codecs such as Parametric Stereo (PS) in EAAC+ and MPEG-Surround. Principle Component Analysis (PCA) stereo coding followed this idea to map two channels to one channel with maximum energy and parameterize the secondary channel. This paper investigates the conventional PCA method performance under general stereo model with multiple sound sources and different directions, and then proposes a Polar Coordinate based PCA (PC-PCA) stereo coding method. It has been proved that when multiple sound sources exist with different directions, PC-PCA is better than the conventional PCA method when Mean to Standard deviation Ratio (MSR) is large. A stereo codec based on PC-PCA is proposed to validate the performance improvement of proposed method. Objective and subjective tests show the proposed method achieves a comparative quality and saves 50% parameter bit rate comparing with conventional PCA method, and obtains a 4-8 MUSHRA scores improvement comparing with state-of-the-art stereo codec at the same parameter bit rate.展开更多
基金supported by the National Natural Science Foundation of China(No.60821001)the National Basic Research Program of China(No.2007CB311203)
文摘Recently, the Internet of Things (loT) has attracted more and more attention. Multimedia sensor network plays an important role in the IoT, and audio event detection in the multimedia sensor net- works is one of the most important applications for the Internet of Things. In practice, it is hard to get enough real-world samples to generate the classifi- ers for some special audio events (e.g., car-crash- ing in the smart traffic system). In this paper, we introduce a TrAdaBoost-based method to solve the above problem. By using the proposed approach, we can train a strong classifier by using only a tiny amount of real-world data and a large number of more easily collected samples (e.g., collected from TV programs), even when the real-world data is not sufficient to train a model alone. We deploy this ap- proach in a smart traffic system to evaluate its per- formance, and the experiment evaluations demonstrate that our method can achieve satisfying results.
基金supported in part by the National Science Foundation of China under grant No.61701457
文摘Coexistence of satellite and terrestrial wireless communication systems in the same frequency band is quite promising for addressing the challenge of spectrum scarcity. To cope with the inevitable inter-system interference, radio resource allocation at both sides should be carefully re-optimized. In this paper, we focus on a scenario where a satellite communication system and a terrestrial distributed antenna system(DAS) coexist via spectrum sharing. We particularly utilize the radio map(RM) to reduce the system overhead for channel acquisition. Based on the large-scale channel state information at the transmitter(CSIT), which is derived from the RM, we propose an optimized power allocation scheme to improve the achievable sum rate of the terrestrial system. For the satellite side, an opportunistic user scheduling scheme is presented, to reduce the harmful leakage interference to the terrestrial mobile users. Simulation results demonstrate that the proposed RM-based coordination scheme can significantly promote the performance of satellite terrestrial coexistence, although the small-scale channel fading has been ignored in the formulated optimization.
基金This work was partially supported by the National Natural Science Foundation of China under Grants No.11161140319,No.61001188,the Specialized Research Fund for the Doctoral Program of Higher Education under Grant No.20101101110020,the Fund for Basic Research from Beijing Institute of Technology under Grant No.20120542011,the Fund for Beijing Higher Education Young Elite Teacher Project under Grant No.YETP1202
文摘Multichannel audio signal is more difficult to be compressed than mono and stereo ones.A novel multichannel audio signal compression method based on tensor representation and decomposition is proposed in this paper.The multichannel audio is represented with 3-order tensor space and is decomposed into core tensor with three factor matrices in the way of channel,time and frequency.Only the truncated core tensor is transmitted which will be multiplied by the pre-trained factor matrices to reconstruct the original tensor space.Objective and subjective experiments have been done to show a very noticeable compression capability with an acceptable output quality.The novelty of the proposed compression method is that it enables both high compression capability and backward compatibility with limited signal distortion to the hearing.
基金supported by National Natural Science Foundation of China under Grants No. 61231015, No. 61102127 No. 61201340, No. 61201169Major National Science and Technology Special Projects under Grant No. 2010ZX03004-003-03+2 种基金Natural Science Foundation of Hubei Province under Grant No. 2011CDB451Wuhan ChenGuang Science and Technology Plan under Grant No. 201150431104the Fundamental Research Funds for the Central Universities
文摘High efficiency audio compression is the basic technology in audio involved multimedia communications. Downmixing and parametric coding is efficient coding scheme with wide applications in some up-to-date audio codecs such as Parametric Stereo (PS) in EAAC+ and MPEG-Surround. Principle Component Analysis (PCA) stereo coding followed this idea to map two channels to one channel with maximum energy and parameterize the secondary channel. This paper investigates the conventional PCA method performance under general stereo model with multiple sound sources and different directions, and then proposes a Polar Coordinate based PCA (PC-PCA) stereo coding method. It has been proved that when multiple sound sources exist with different directions, PC-PCA is better than the conventional PCA method when Mean to Standard deviation Ratio (MSR) is large. A stereo codec based on PC-PCA is proposed to validate the performance improvement of proposed method. Objective and subjective tests show the proposed method achieves a comparative quality and saves 50% parameter bit rate comparing with conventional PCA method, and obtains a 4-8 MUSHRA scores improvement comparing with state-of-the-art stereo codec at the same parameter bit rate.