Edition Date: 2019-11-12
File category: Maintenance program
Content of this volume: mainly describe how to troubleshoot various faults of the T17 + hash board, and how to use test fixtures for accurate positioning.
※ The copyright of this article belongs to Bitmaintech Pte.Ltd. (Bitmain). The article shall solely be reprinted, extracted or used in any other ways with the permission of the copyright owner. Please contact Bitmain official customer service if there is any need of reprinting or quoting.
I. Requirements for Maintenance Platform
1. Constant temperature soldering iron (370 - 400 ° C); the head of tip soldering iron is used to solder small patches such as chip resistors and capacitors, etc.
2. Heat gun and BGA rework station are used for chip disassembly and soldering. Do not heat for a long time to avoid PCB foaming.
3. APW9 + power supply (APW9 + _14.5V-21V_V2.01 (470uF450V)) and power patch cord (self-made), used for power supply of the hash board
4. Fluke 15b+ Multimeter, tweezers, 2.1040 Antminer test fixture (conditionally configurable oscilloscope).
5. Flux Solder paste, water for cleaning panel and anhydrous alcohol; water for cleaning panel is used to clean up the residue and appearance of the flux after maintenance.
6. Tin-planting fixture, tin-planting steel mesh, tin-absorbing wire, solder paste; when replacing a new chip, the chip pin shall be planted with tin. BSM surface shall be planted with tin before being soldered to the hash board.
7. Thermal conductive paste, used to apply on the chip / heat sinks after repair.
8. You can also choose the simple and convenient Bitmain antminer hash board repair bundle, which includes the necessary Antminer repair tools. Of course, Antminer T17+ control board is also a very important spare part.
II. Operational requirements
1. Maintenance personnel must have certain electronic knowledge, more than one year of maintenance experience, and be proficient in BGA / QFN / LGA package soldering technology.
2. After repairing, the hash board must be tested to be OK for more than twice, otherwise, it shall be rejected.
3. Please pay attention to the operation method when replacing the chip. There should be no obvious deformation of the PCB board after replacing any accessories. Check whether there are any open or short circuits, or missing parts in the replacement parts and the surroundings.
4. Check and confirm the tools and the test fixtures can work normally, determine the test software parameters for the maintenance station, and version of test fixtures, etc.
5. After passing the repair and replacement chip test, users need to check the full chip before performing the functional test. The functional test shall ensure that the double-sided heat sinks are soldered OK and the cooling fan is at full speed. When using the chassis cooling function, users must put 2 hash boards at the same time to form air duct. The single-sided test of production must also ensure the formation of air ducts (important)
6. When measuring the signal, two fans are used to dissipate heat as assistance measure, and the fans shall maintain full speed.
III. Principles and Structure
1. Principle description:
1) T17 + consists of 44 BM1397 chips and 11 voltage domains; each 4 ICs are a group.
2) The working voltage of the BM1397 chip used by T17 + is 1.55V, and the chip is equipped with LDO to supply VDDIO 1.8V, VDDPLL 0.8V power supply.
3) The T17 + clock is a 25M crystal oscillator, which is transmitted in series from the first chip to the 44th one.
2. T17 + hash board signal direction:
CLK (XIN) signal direction: It is generated by Y1 25M crystal oscillator and transmitted from chip 01 to chip 44. During operation, the voltage is 1.6-1.8V.
TX (CI, CO) signal direction: input from pin 7 (3.3V) of IO port, transferred to IC U2 through level conversion, then transmitted from chip 01 to chip 44; the voltage is 0V when the IO line is not inserted, and the voltage during operation is 1.8V.
RX (RI, RO) signal direction: from chip 44 to chip 01, return to pin 8 of the signal cable terminal via U1 and return to the control board; the voltage is 0.3V when the IO line is not inserted, and the voltage during operation is 1.8V.
BO (BI, BO) signal direction: from chip 01 to chip 44; measure value is 0V when measuring with a multimeter. RST signal direction: input from pin 3 of the IO port, and then transmitted from chip 01 to chip 44; 0V without IO signal or in standby and 1.8V in operation.
3. T17 + hash board key circuit
During maintenance, mainly measure the 10 signal voltages in front of and behind the chip (5 in front of and 5 behind the chip: CLK, CO, RI, BO, RST); CORE voltage; LDO-1.8V, PLL-0.8V; 21 to 23V voltage transformation of booster circuit.
Detection method (each chip has lead-out test points):
Plug in the IO line and press the fixture test key, then the PIC starts to work. At this time, the normal voltage of each test point should be:
RI: 1.6-1.8V. During operation, abnormal or too-low voltage may cause the hash board to be abnormal or the hashrate to be 0.
BO: 0V when no operation is performed, and 0V during operation.
RST: 1.8V. Each time the test button of the fixture is pressed, the reset signal is output again.
When the status and voltage of the test points are abnormal, please estimate the fault point based on the circuit in front of and behind the test point.
Note: The steel windshield of the hash board is 21V. During the measurement and maintenance, please keep the maintenance table clean and insulated to avoid short circuit during the maintenance.
IV. Malfunctions and Poor Performance
1. LDO 1.8V or PLL 0.8V is abnormal. For this bad problem: the single board appears as incomplete chip reading; the entire miner appears as few chips or dropped boards or chip crosses.
The LDO of last three groups is powered by the 23V output of the boost circuit U6, and the other groups are powered by the 21V divided voltage converted by the LDO.
2. The working voltage (core voltage) of chip is abnormal. For this bad problem: the single board appears as abnormal reading of number of chips on the board; the entire miner appears as few chips or dropped boards. Please check PIC soldering and programming.
Check the MOS of the corresponding group for short circuit
hash board PIC Programming
b. Download tools
PICkit3: pin 1 of the PICkit3 cable corresponds to pin 1 of J3 on the PCB, and needs to connect to pins 1, 2, 3, 4, 5, and 6.
c. Programming software
Open MPLAB IPE, select device: PIC16F1704, click power to select the power supply mode, then click operate: 1 select “file” to find the .HEX file to be programmed; 2, click “connect” to connect normally; 3, click the program button; after completion, click verify, then it will prompt that the verification is complete, verifying successful programming.
3. The chip signal pin output is abnormal (BO / RST / CO / RI / CLK). Determine the bad position according to the signal direction.
Please measure the impedance of the chip to ground firstly after power is off (compared to a good board or an adjacent group). If possible, users can use X-RAY to view the soldering effect of the chip.
4. The temperature reading is abnormal. This bad single board appears as sensor NG (the fixture interface can display temp NG, log synchronization test results); the entire miner appears as a temperature reading of 0 °C or no temperature reading.
In case of this kind of problem, please check the temperature-sensitive model and the welding of surrounding parts, and report the soldering of the chip corresponding to abnormal temperature-sensitive condition.
5. Test NG, and find insufficient nonce return of hash board. For this bad problem: the single board appears as insufficient nonce return is tested; please perform soldering inspection according to the bad chips displayed in the log; it is recommended to re-solder the chip or replace the corresponding NG chip after checking that there is no problem with the welding around the chip.
V. Reference steps for hash board maintenance
1. Routine test:
(1) First perform visual inspection on the hash board to be repaired, observe whether the PCB is deformed or burnt. If yes, it must be handled first; check whether there are any parts with obvious burn marks, collision offset or missing parts, etc.
(2) If no problem is found through visual inspection, the impedance of each voltage domain can be tested first to detect whether there is a short circuit or an open circuit. If yes, it must be handled first.
(3) Check whether the voltage of each domain is about 1.6V.
2. After the routine test is OK (in general, the short-circuit test is necessary for the routine test to prevent the chip or other materials from being burned due to short circuit when the power is on), you can use the test fixture to perform chip detection, and determine the positioning based on the test fixture test results.
3. According to the display result of the test fixture test, test the voltages of chip test points (CO / NRST / RO / XIN / BI), VDD0V8 and VDD1V8, etc. starting from the vicinity of the faulty chip.
4. According to the signal direction (the RX signal is passed in the reverse direction (from chip 44 to 1), and several signals CLK CO BO RST are transmitted in the forward direction (from chip 1 to 44), find the abnormal fault point through the power supply sequence.
5. When positioning to the faulty chip, the chip needs to be re-soldered. The method is to add a flux around the chip (preferably no-clean flux), and heat the solder joints of the chip pins to a dissolved status, so as to promote the chip pins and the pads to re-run, then removing tin finally, thus achieving the effect of re-tinning. If the failure is the same after re-soldering, the chip shall be replaced directly.
6. The repaired hash board can be determined to be a good product if it passes the fixture tests for more than twice. For the first time, after the replacement of the accessories is complete, wait for the hash board to cool down and perform fixture test, after passing, set it aside and then cool it down; for the second time, wait for a few minutes until the hash board is completely cooled before testing.
7. After the board is repaired, relevant maintenance / analysis records (requirements for maintenance reports: date, SN, PCB version, tag number, bad cause, bad liability attribution, etc.) should be prepared for feedback to production, after-sales, research and development departments.
8. After the record is prepared, install the entire miner for conventional aging.