통합 검색 | Korea Science

수퍼스칼라 마이크로프로세서용 부동 소수점 연산회로의 설계 (A design of floating-point arithmetic unit for superscalar microprocessor)

최병윤;손승일;이문기
- 한국통신학회논문지
- /
- 제21권5호
- /
- pp.1345-1359
- /
- 1996
This paper presents a floating point arithmetic unit (FPAU) for supescalar microprocessor that executes fifteen operations such as addition, subtraction, data format converting, and compare operation using two pipelined arithmetic paths and new rounding and normalization scheme. By using two pipelined arithmetic paths, each aritchmetic operation can be assigned into appropriate arithmetic path which high speed operation is possible. The proposed normalization an rouding scheme enables the FPAU to execute roundig operation in parallel with normalization and to reduce timing delay of post-normalization. And by predicting leading one position of results using input operands, leading one detection(LOD) operation to normalize results in the conventional arithmetic unit can be eliminated. Because the FPAU can execuate fifteen single-precision or double-precision floating-point arithmetic operations through three-stage pipelined datapath and support IEEE standard 754, it has appropriate structure which can be ingegrated into superscalar microprocessor.
PDF

IEEE 754-1985 단정도 부동 소수점 연산용 나눗셈기 설계 (Design of a Floating-Point Divider for IEEE 754-1985 Single-Precision Operations)

박안수;정태상
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 2001년도 합동 추계학술대회 논문집 정보 및 제어부문
- /
- pp.165-168
- /
- 2001
This paper presents a design of a divide unit supporting IEEE-754 floating point standard single-precision with 32-bit word length. Its functions have been verified with ALTERA MAX PLUS II tool. For a high-speed division operation, the radix-4 non-restoring algorithm has been applied and CLA(carry-look -ahead) adders has been used in order to improve the area efficiency and the speed of performance for the fraction division part. The prevention of the speed decrement of operations due to clocking has been achieved by taking advantage of combinational logic. A quotient select block which is very complicated and significant in the high-radix part was designed by using P-D plot in order to select the fast and accurate quotient. Also, we designed all division steps with Gate-level which visualize the operations and delay time.
PDF

IEEE 반올림과 덧셈을 동시에 수행하는 부동 소수점 곱셈 연산기 설계 (Design of the floating point multiplier performing IEEE rounding and addition in parallel)

박우찬;정철호
- 전자공학회논문지C
- /
- 제34C권11호
- /
- pp.47-55
- /
- 1997
In general, processing flow of the conventional floating-point multiplication consists of either multiplication, addition, normalization, and rounding stage of the conventional floating-point multiplier requries a high speed adder for increment, increasing the overall execution time and occuping a large amount of chip area. A floating-point multiplier performing addition and IEEE rounding in parallel is designed by using the carry select addder used in the addition stage and optimizing the operational flow based on the charcteristics of floating point multiplication operation. A hardware model for the floating point multiplier is proposed and its operational model is algebraically analyzed in this paper. The proposed floating point multiplier does not require and additional execution time nor any high spped adder for rounding operation. Thus, performance improvement and cost-effective design can be achieved by this suggested approach.
PDF

내장형 프로세서를 위한 IEEE-754 고성능 부동소수점 나눗셈기의 설계 (IEEE-754 Floating-Point Divider for Embedded Processors)

정재원;홍인표;정우경;이용석
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2000년도 추계종합학술대회 논문집(2)
- /
- pp.353-356
- /
- 2000
In this paper, a high-performance and small-area floating-point divider, which is suitable for embedded processors and supports all rounding modes defined by IEEE 754 standard, is designed using the series expansion algorithm. This divider shares and fully utilizes the two MAC units for quadratical convergence to the correct quotient. The area increase of two MAC units due to the division is minimized in this design, so that it can be suitable for embedded processors. The tested HDL codes are synthesized and optimized with 0.35$\mu\textrm{m}$ CMOS standard celt libraries. The results show that the latency of the synthesized divider is 17.43 ㎱ in worst condition. But, the divider calculates the correct rounded quotient through only 6 cycles.
PDF

TMS320C32 DSP를 이용한 실시간 화자종속 음성인식 하드뒈어 모듈 구현 (Real-time implementation of speaker dependent speech recognition hardware module using the TMS320C32 DSP)

정훈;정익주
- 한국음향학회:학술대회논문집
- /
- 한국음향학회 1998년도 제15회 음성통신 및 신호처리 워크샵(KSCSP 98 15권1호)
- /
- pp.14-22
- /
- 1998
본 연구에서는 Texas instruments 사의 저가형 부동소수점 디지털 신호 처리기인 TMS320C32를 이용하여 실시간 화자종속 음성인식 하드웨어 모듈을 개발하였다. 하드웨어 모듈의구성은 40MHz 의 TMS320C32, 14bit 코덱인 TLC32044, EPROM 과 SRAM 등의 메모리와 호스트 인터페이스를 위한 로직회로로 이루어져 있다. 뿐만 아니라 이 하드웨어 모듈을 PC 상에서 평가해보기 위한 PC 인터페이스용 보드 및 소프트웨어도 개발하였다. 음성인식 알고리즘은 C 및 어셈블리를 이용한 최적화를 통하여 계산속도를 대폭 개선하였다. 현재 인식률은 일반 사무실 환경에서 30단어에 대하여 95% 이상으로 매우 높은 편이며, 특히 배경음악이나 자동차 소음과 같은 잡음환경에서도 잘 동작한다.
PDF

SUB-BAND 적응 디지털 필터 실시간 시스템 구현에 관한 연구 (A Study on Real Time Implementation of an Adaptive Digital Filter Using a Sub-band Structure)

류차희;윤대희;유재하;차일환
- 한국음향학회지
- /
- 제12권6호
- /
- pp.13-20
- /
- 1993
충격 응답 시간이 긴 시스템을 모델링하기 위한 실시간 적응 디지털 필터를 구현하였다. 대상 시스템의 충격 응답 시간이 길 때, 일반적인 적응 디지털 필터를 사용하는 경우 발생하는 수렴 속도 저하와 계산량 증가 문제를 해결하기 위해서 서브밴드 구조를 갖는 적응 디지털 필터를 구성하였다. 실시간 처리 시스템에서는 GQMF을 사용하여 입력 신호를 4개 대역으로 분할하여 각 대역별로 적응 필터링을 수행함으로써 수렴 속도를 향상시킨다. 또한 대역별 신호를 동시에 분산 처리하기 때문에 계산량 면에서 효율적이므로 시스템의 충격 응답이 긴 경우에는 실시간 처리가 가능하다. 하드웨어 구성은 범용 신호 처리 프로세서인 DSP56001을 호스트 프로세서로 사용하며, 적응 디지털 필터 칩 DSP56200을 사용하여 각 대역 적응 필터를 구성하였다. 실험은 충격 응답 시간이 16 kHz 필터링 시 2000 탭 길이로 가정된 시스템을 대상으로 부동 소수점 시뮬레이션 결과와 실시간 처리 시스템의 결과를 비교하였다. 밴드를 나누지 않은 기존의 방법과 서브밴드 시스템의 비교 실험 결과 입력이 백색 잡음인 경우 대역별 간섭에 의한 성능 저하가 있었으나, 음성과 유사한 특성을 갖는 유색 잡음인 경우 서브밴드 시스템이 단일 시스템에 비해 성능 향상을 보였다.
PDF

IEEE 부동 소수점 덧셈/뺄셈 연산에서 효율적인 반올림 알고리즘과 구현 (Efficient Rounding Algorithm and Implementation for IEEE Floating Point Addition/Subtraction)

김병화;안현식;김도현
- 전자공학회논문지B
- /
- 제32B권3호
- /
- pp.24-30
- /
- 1995
The process of conventional floating-point additio $n_traction operation consists of alignment, additio $n_traction, normalization, and rounding stage. Because rounding stage needs an incrementor or adder, it occupies much time and chip area. In addition, it needs additional time and hardware for renormalization which occurs in overflow due to rounding In this paper, floating-point adde $r_tractor performing rounding and additio $n_traction in parallel is presented by using the feature of additio $n_traction and carry select adder used in additio $n_tracting stage. Proposed floating point adde $r_tractor doesn't need time and incrementor nor adder for rounding. Also, renormalization doesn't occur since rounding is performed prior to normalization.to normalization.
PDF

MATLAB을 이용한 부동소수점 연산의 특이사항 분석 (Analysis of Some Strange Behaviors of Floating Point Arithmetic using MATLAB Programs)

정태상
- 전기학회논문지
- /
- 제56권2호
- /
- pp.428-431
- /
- 2007
A floating-point number system is used to represent a wide range of real numbers using finite number of bits. The standard the IEEE adopted in 1987 divides the range of real numbers into intervals of [$2^E,2^{E+1}$), where E is an Integer represented with finite bits, and defines equally spaced equal counts of discrete numbers in each interval. Since the numbers are defined discretely, not only the number representation itself includes errors but in floating-point arithmetic some strange behaviors are observed which cannot be agreed with the real world arithmetic. In this paper errors with floating-point number representation, those with arithmetic operations, and those due to order of arithmetic operations are analyzed theoretically with help of and verification with the results of some MATLAB program executions.
PDF KSCI

16-bit CPU용 C 컴파일러 개발 (Development of C Compiler for 16-bit CPU)

정삼진
- 한국산학기술학회:학술대회논문집
- /
- 한국산학기술학회 2009년도 춘계학술발표논문집
- /
- pp.439-442
- /
- 2009
본 연구는 16 비트 CPU를 위한 새로운 C 컴파일러를 개발하고자 한다. 새로운 ASIC 프로세서가 특정 용도로 설계되었을 때 그 CPU를 위한 새로운 컴파일러의 개발이 필요하다. 공개 소프트웨어인 GNU C 컴파일러를 사용하여 기계 의존 원시 파일들을 수정함으로서 새로운 컴파일러를 개발할 수 있다. 개발된 컴파일러는 단지 기계어에 의해 처리될 수 있는 기능들만 지원할 수 있기 때문에 곱 셈, 나눗셈, 부동소수점 처리등과 같은 기능들을 지원하기 위해서는 더 많은 연구가 필요하다. 완전한 컴파일러가 개발된 후에는 새로운 CPU에서 실행될 수 있는 응용 프로그램의 개발이 필요하다. 본 연구에 의해서 앞으로 개발될 여러 가지 다른 용도의 CPU를 위한 컴파일러들이 쉽게 개발될 수 있을 것이다.
PDF

선택적 보간기법을 사용한 개선된 칼라영상 보간 알고리즘 (Improved algorithm of color image interpolation by using Selective interpolation)

이성목;김주현;박정환;곽부동;강봉순
- 융합신호처리학회 학술대회논문집
- /
- 한국신호처리시스템학회 2006년도 하계 학술대회 논문집
- /
- pp.33-36
- /
- 2006
본 논문은 에지부분에서 뚜렷한 영상의 복원이 가능하도록 하는 칼라 영상의 비선형 보간 기법에 관한 것이다. 일반적으로 칼라영상을 구성하고 있는 성분중 휘도신호(Y)가 에지(edge)성분에 충실한 정보를 갖고 있다는 점에 착안하여 알고리즘 연구를 수행하였다. 일반적인 선형 보간 방법을 사용할 시 영상에서 고주파 대역의 손실을 일으키므로 영상의 화질 열화가 발생한다. 이를 보완하기 위해 본 논문에서는 비선형 보간법인 에지 방향성 보간 방법을 제안하였다 또한 조밀한 에지 영역에서의 에지 방향성 보간의 단점을 극복하기 위해 선형 보간과 에지방향성 보간 기법의 혼합을 통한 화질 열화 제거 기법을 제안한다.
PDF

검색결과 245건 처리시간 0.019초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)