• 제목/요약/키워드: 텐서

Search Result 378, Processing Time 0.026 seconds

Analyzing the performance of training tasks based on GPU memory use manner of TensorFlow in Container environments (컨테이너 환경에서 텐서플로의 GPU 메모리 사용방식에 따른 학습 작업의 성능 분석)

  • Jihun Kang;Joon-Min Gil
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2023.05a
    • /
    • pp.60-62
    • /
    • 2023
  • 인공지능의 학습 작업은 연산량이 많아 고성능 연산 장치인 GPU(Graphics Processing Unit)를 필요로 하며, GPU 장치의 성능은 학습 작업의 실행 성능에 직접적으로 영향을 미치는 요소 중 하나로 작용한다. 인공지능 작업을 처리하기 위해 많이 사용되는 텐서플로의 경우 GPU를 사용해 연산을 수행할 때 기본적으로 거의 모든 GPU 메모리 영역을 단일 학습 작업이 점유하도록 GPU 메모리를 관리한다. 이 방법은 컴퓨팅 자원 중 확장성이 가장 낮은 GPU 메모리의 단편화를 방지하기 위해 사용되는 방법이지만, 하나의 학습 작업이 GPU를 점유하게 되면, 실제 GPU 메모리 사용량과 상관없이 다른 프로세스는 GPU를 사용할 수 없는 문제를 유발한다. 특히, 전이학습, 소규모 학습과 같이 상대적으로 작업 규모가 작은 경우에는 전체 GPU 메모리 용량 중 대부분의 영역이 낭비된다. 본 논문에서는 컨테이너 환경에서 텐서플로의 기본 GPU 메모리 사용 방식으로 인해 다수의 학습 작업을 동시 실행하는 것이 불가능한 문제를 확인하고 GPU 메모리 사용량을 제한한 경우와 하지 않은 경우에 실제 GPU 메모리 사용량과 학습 작업의 실행 시간에 대한 성능 비교를 통해 GPU 메모리의 단편화 방지가 성능에 유의미한 요소인지 검증한다.

Image Reconstruction of Eigenvalue of Diffusion Principal Axis Using Diffusion Tensor Imaging (확산텐서영상을 이용한 확산 주축의 고유치 영상 재구성)

  • Kim, In-Seong;Kim, Joo-Hyun;Yeon, Gun;Suh, Kyung-Jin;Yoo, Don-Sik;Kang, Duk-Sik;Bae, Sung-Jin;Chang, Yong-Min
    • Investigative Magnetic Resonance Imaging
    • /
    • v.11 no.2
    • /
    • pp.110-118
    • /
    • 2007
  • Purpose: The objective of this work to construct eigenvalue maps that have information of magnitude of three primary diffusion directions using diffusion tensor images. Materials and Methods: To construct eigenvalue maps, we used a 3.0T MRI scanner. We also compared the Moore-Penrose pseudo-inverse matrix method and the SVD (single value decomposition) method to calculate magnitude of three primary diffusion directions. Eigenvalue maps were constructed by calculating of magnitude of three primary diffusion directions. We did investigate the relationship between eigenvalue maps and fractional anisotropy map. Results: Using Diffusion Tensor Images by diffusion tensor imaging sequence, we did construct eigenvalue maps of three primary diffusion directions. Comparison between eigenvalue maps and Fractional Anisotropy map shows what is difference of Fractional Anisotropy value in brain anatomy. Furthermore, through the simulation of variable eigenvalues, we confirmed changes of Fractional Anisotropy values by variable eigenvalues. And Fractional anisotropy was not determined by magnitude of each primary diffusion direction, but it was determined by combination of each primary diffusion direction. Conclusion: By construction of eigenvalue maps, we can confirm what is the reason of fractional anisotropy variation by measurement the magnitude of three primary diffusion directions on lesion of brain white matter, using eigenvalue maps and fractional anisotropy map.

  • PDF

Program Development to Evaluate Permeability Tensor of Fractured Media Using Borehole Televiewer and BIPS Images and an Assessment of Feasibility of the Program on Field Sites (시추공 텔리뷰어 및 BIPS의 영상자료 해석을 통한 파쇄매질의 투수율텐서 계산 프로그램 개발 및 현장 적용성 평가)

  • 구민호;이동우;원경식
    • The Journal of Engineering Geology
    • /
    • v.9 no.3
    • /
    • pp.187-206
    • /
    • 1999
  • A computer program to numerically predict the permeability tensor of fractured rocks is developed using information on discontinuities which Borehole Televiewer and Borehole Image Processing System (BIPS) provide. It uses orientation and thickness of a large number of discontinuities as input data, and calculates relative values of the 9 elements consisting of the permeability tensor by the formulation based on the EPM model, which regards a fractured rock as a homogeneous, anisotropic porous medium. In order to assess feasibility of the program on field sites, the numerically calculated tensor was obtained using BIPS logs and compared to the results of pumping test conducted in the boreholes of the study area. The degree of horizontal anisotropy and the direction of maximum horizontal permeability are 2.8 and $N77^{\circ}CE$, respectively, determined from the pumping test data, while 3.0 and $N63^{\circ}CE$ from the numerical analysis by the developed program. Disagreement between two analyses, especially for the principal direction of anisotropy, seems to be caused by problems in analyzing the pumping test data, in applicability of the EPM model and the cubic law, and in simplified relationship between the crack size and aperture. Aside from these problems, consideration of hydraulic parameters characterizing roughness of cracks and infilling materials seems to be required to improve feasibility of the proposed program. Three-dimensional assessment of its feasibility on field sites can be accomplished by conducting a series of cross-hole packer tests consisting of an injecting well and a monitoring well at close distance.

  • PDF

An Acoustic Event Detection Method in Tunnels Using Non-negative Tensor Factorization and Hidden Markov Model (비음수 텐서 분해와 은닉 마코프 모델을 이용한 터널 환경에서의 음향 사고 검지 방법)

  • Kim, Nam Kyun;Jeon, Kwang Myung;Kim, Hong Kook
    • Asia-pacific Journal of Multimedia Services Convergent with Art, Humanities, and Sociology
    • /
    • v.8 no.9
    • /
    • pp.265-273
    • /
    • 2018
  • In this paper, we propose an acoustic event detection method in tunnels using non-negative tensor factorization (NTF) and hidden Markov model (HMM) applied to multi-channel audio signals. Incidents in tunnel are inherent to the system and occur unavoidably with known probability. Incidents can easily happen minor accidents and extend right through to major disaster. Most incident detection systems deploy visual incident detection (VID) systems that often cause false alarms due to various constraints such as night obstacles and a limit of viewing angle. To this end, the proposed method first tries to separate and detect every acoustic event, which is assumed to be an in-tunnel incident, from noisy acoustic signals by using an NTF technique. Then, maximum likelihood estimation using Gaussian mixture model (GMM)-HMMs is carried out to verify whether or not each detected event is an actual incident. Performance evaluation shows that the proposed method operates in real time and achieves high detection accuracy under simulated tunnel conditions.

Comparison of Deep Learning Frameworks: About Theano, Tensorflow, and Cognitive Toolkit (딥러닝 프레임워크의 비교: 티아노, 텐서플로, CNTK를 중심으로)

  • Chung, Yeojin;Ahn, SungMahn;Yang, Jiheon;Lee, Jaejoon
    • Journal of Intelligence and Information Systems
    • /
    • v.23 no.2
    • /
    • pp.1-17
    • /
    • 2017
  • The deep learning framework is software designed to help develop deep learning models. Some of its important functions include "automatic differentiation" and "utilization of GPU". The list of popular deep learning framework includes Caffe (BVLC) and Theano (University of Montreal). And recently, Microsoft's deep learning framework, Microsoft Cognitive Toolkit, was released as open-source license, following Google's Tensorflow a year earlier. The early deep learning frameworks have been developed mainly for research at universities. Beginning with the inception of Tensorflow, however, it seems that companies such as Microsoft and Facebook have started to join the competition of framework development. Given the trend, Google and other companies are expected to continue investing in the deep learning framework to bring forward the initiative in the artificial intelligence business. From this point of view, we think it is a good time to compare some of deep learning frameworks. So we compare three deep learning frameworks which can be used as a Python library. Those are Google's Tensorflow, Microsoft's CNTK, and Theano which is sort of a predecessor of the preceding two. The most common and important function of deep learning frameworks is the ability to perform automatic differentiation. Basically all the mathematical expressions of deep learning models can be represented as computational graphs, which consist of nodes and edges. Partial derivatives on each edge of a computational graph can then be obtained. With the partial derivatives, we can let software compute differentiation of any node with respect to any variable by utilizing chain rule of Calculus. First of all, the convenience of coding is in the order of CNTK, Tensorflow, and Theano. The criterion is simply based on the lengths of the codes and the learning curve and the ease of coding are not the main concern. According to the criteria, Theano was the most difficult to implement with, and CNTK and Tensorflow were somewhat easier. With Tensorflow, we need to define weight variables and biases explicitly. The reason that CNTK and Tensorflow are easier to implement with is that those frameworks provide us with more abstraction than Theano. We, however, need to mention that low-level coding is not always bad. It gives us flexibility of coding. With the low-level coding such as in Theano, we can implement and test any new deep learning models or any new search methods that we can think of. The assessment of the execution speed of each framework is that there is not meaningful difference. According to the experiment, execution speeds of Theano and Tensorflow are very similar, although the experiment was limited to a CNN model. In the case of CNTK, the experimental environment was not maintained as the same. The code written in CNTK has to be run in PC environment without GPU where codes execute as much as 50 times slower than with GPU. But we concluded that the difference of execution speed was within the range of variation caused by the different hardware setup. In this study, we compared three types of deep learning framework: Theano, Tensorflow, and CNTK. According to Wikipedia, there are 12 available deep learning frameworks. And 15 different attributes differentiate each framework. Some of the important attributes would include interface language (Python, C ++, Java, etc.) and the availability of libraries on various deep learning models such as CNN, RNN, DBN, and etc. And if a user implements a large scale deep learning model, it will also be important to support multiple GPU or multiple servers. Also, if you are learning the deep learning model, it would also be important if there are enough examples and references.

Analysis of transmissivity tensor in an anisotropic aquifer (이방성 대수층에서의 투수량계수텐서 해석)

  • 강철희;이대하;김구영;이철우;김용제;우남칠
    • Journal of Soil and Groundwater Environment
    • /
    • v.7 no.2
    • /
    • pp.53-61
    • /
    • 2002
  • An Aquifer test was carried out on five boreholes to determine the hydrologic anisotropy and the major groundwater flow direction in the aquifer system of the study area. With an assumption of the aquifer's anisotropy and homogeneity, the major transmissivity(T(equation omitted)), the minor transmissivity( $T_{ηη}$ ), and primary tensor direction ($\theta$) for each borehole were determined from the test. Besides the boreholes BH-1, BH-4 and BH-5, the anisotropy transmissivity tensor values of BH-2 and BH-3 did not correspond with the assumption. Thereafter the values were plotted on the polar coordinate, and showed that the tensor values were out of the anisotropy ellipsoid due to the high heterogeneity of BH-2 and BH-3 comparing with the other boreholes. Therefore. the anisotropy of the aquifer was examined from BH-1, BH-4. and BH-5. In BH-1, T(equation omitted) is 171.9 $\m^2$/day. $T_{ηη}$ is $71.01\m^2$/day, and the principal tensor direction is Nl5.39$^{\circ}$E. In BH-4. T(equation omitted) is $268.2 \m^2$/day, $T_{ηη}$ / is $28.75\m^2$/day and the principal tensor direction is N7.55$^{\circ}$E. In BH-5, T(equation omitted) is $168.4\m^2$/day, $T_{ηη}$ is 66.80 $\m^2$/day, and the principal tensor direction is $N76.59^{\circ}$E. On the basis of teleview logging performed on each borehole. the principal fracture directions were revealed as $N0^{\circ}$~4$^{\circ}$E/$30^{\circ}$~$50^{\circ}$SE and $N30^{\circ}$~$80^{\circ}$W/$20^{\circ}$~$50^{\circ}$NE that are the most frequently occurred sets as well as that correspond well with the calculated transmissivity tensor.