Data sampling method and device, electronic equipment and readable storage medium

文档序号:7595 发布日期:2021-09-17 浏览:45次 中文

1. A method for data spot check, the method comprising:

acquiring data, and processing the data to obtain a random seed, wherein the random seed is obtained by a password hash function value;

processing the random seeds to generate sequence information;

and extracting the sequence information to obtain a data sampling inspection result.

2. The method of claim 1, wherein processing the data to obtain a random seed comprises:

and calculating the data by using a password hash function to obtain a password hash function value.

3. The method of claim 1, wherein processing the random seed to generate sequence information comprises:

and inputting the random seeds into a computer programming language, and operating the computer programming language to obtain a random sequence, wherein the random sequence is obtained by the sequence information.

4. The method of claim 1, wherein obtaining the data spot check result comprises:

and the first user checks and accepts the data sampling inspection result to obtain a checking and accepting result, the checking and accepting result passes, the first user feeds the checking and accepting result back to the second user, and the second user checks and accepts the data sampling inspection result again and can pass the checking and accepting result once.

5. The method of claim 4, wherein the obtaining of the data spot check result further comprises:

and if the acceptance result does not pass, the first user feeds back the acceptance result to the second user.

6. The method of claim 1, wherein the obtaining the data spot check result comprises:

and if the first user and the second user are in a credible cooperation state, the first user firstly checks the data sampling inspection result to obtain a checking result, and the first user feeds the checking result back to the second user.

7. The method of claim 6, wherein the obtaining of the data spot check result further comprises:

if the first user and the second user are not in a trusted cooperative state, providing additional brand new data for each delivery of the first user and the second user, and attaching the additional brand new data to a delivery receipt, wherein the additional brand new data comprises but is not limited to a name, time and signature image of a deliverer; and the delivery order is a proof that the second user confirms to receive the data accepted by the first user for the data sampling inspection result.

8. An apparatus for data spot check, the apparatus comprising:

the receiving module is used for acquiring data and processing the data to obtain a random seed, wherein the random seed is obtained by a password hash function value;

the generating module is used for processing the random seeds to generate sequence information;

and the extraction module is used for extracting the sequence information to obtain a data sampling inspection result.

9. An electronic device is characterized by comprising a processor, a communication interface, a memory and a communication bus, wherein the processor and the communication interface are used for realizing mutual communication by the memory through the communication bus;

a memory for storing a computer program;

a processor for implementing the method of any one of claims 1 to 7 when executing a program stored in the memory.

10. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the spot check method according to any one of claims 1 to 7.

Background

With the development of data acceptance technology, for example, when the manual translation result in the chinese language is accepted, both the first user and the second user may respectively perform multiple random extractions on the manual translation result in the chinese language, and select the extraction result that is most favorable for each user to accept, so that the problem that the quality of data cannot be accurately evaluated may occur.

In order to avoid the problem that the data quality cannot be accurately evaluated when the manual translation result of the small language is checked, in the prior art, random seeds are input into a computer programming language, codes are operated to obtain a random sequence, an extraction result is obtained by extraction in the random sequence, and the extraction result is checked. If the value of the random seed is not set, the system selects the value of the random seed according to time, and the random sequence generated each time is different due to time difference.

However, in the actual application process, the random seed in the prior art is not fixed, and the random sequence generated by running the code is not fixed, which may cause different extraction results in the random sequence for the first user and the second user, respectively, and may cause a problem of low delivery passing rate.

Disclosure of Invention

In view of the above, the present invention is proposed to provide a data spot check method, apparatus, electronic device and readable storage medium that overcome or at least partially solve the above problems.

According to a first aspect of the present invention, there is provided a method for data spot check, the method comprising:

acquiring data, and processing the data to obtain a random seed, wherein the random seed is obtained by a password hash function value;

processing the random seeds to generate sequence information;

and extracting the sequence information to obtain a data sampling inspection result.

Optionally, the processing the data to obtain the random seed includes:

and calculating the data by using a password hash function to obtain a password hash function value.

Optionally, processing the random seed, and generating sequence information includes:

and inputting the random seeds into a computer programming language, and operating the computer programming language to obtain a random sequence, wherein the random sequence is obtained by the sequence information.

Optionally, the obtaining of the data spot check result includes:

and the first user checks and accepts the data sampling inspection result to obtain a checking and accepting result, the checking and accepting result passes, the first user feeds the checking and accepting result back to the second user, and the second user checks and accepts the data sampling inspection result again and can pass the checking and accepting result once.

Optionally, the obtaining the data spot check result further includes:

and if the acceptance result does not pass, the first user feeds back the acceptance result to the second user.

Optionally, the obtaining of the data spot check result includes:

and if the first user and the second user are in a credible cooperation state, the first user firstly checks the data sampling inspection result to obtain a checking result, and the first user feeds the checking result back to the second user.

Optionally, the obtaining the data spot check result further includes:

if the first user and the second user are not in a trusted cooperative state, providing additional brand new data for each delivery of the first user and the second user, and attaching the additional brand new data to a delivery receipt, wherein the additional brand new data comprises but is not limited to a name, time and signature image of a deliverer; and the delivery order is a proof that the second user confirms to receive the data accepted by the first user for the data sampling inspection result.

According to a second aspect of the present invention, there is provided an apparatus for data spot check, the apparatus comprising:

the receiving module is used for acquiring the data and processing the data to obtain a random seed, wherein the random seed is obtained by a password hash function value;

the generating module is used for processing the random seeds to generate sequence information;

and the extraction module is used for extracting the sequence information to obtain a data sampling inspection result.

According to a third aspect of the present invention, there is provided an electronic device for data spot check, the electronic device comprising:

the system comprises a processor, a communication interface, a memory and a communication bus, wherein the processor, the communication interface and the memory complete mutual communication through the communication bus;

a memory for storing a computer program;

a processor for implementing the steps of the method according to the first aspect when executing a program stored in the memory.

According to a fourth aspect of the present invention, there is provided a readable storage medium for data spot check, the readable storage medium comprising:

the computer readable storage medium has stored thereon a computer program which, when executed by a processor, performs the steps of the method according to the first aspect.

According to the data spot check method, the data spot check device, the electronic equipment and the readable storage medium provided by the embodiment of the invention, before both sides check, the first user and the second user confirm the data to be spot checked firstly, the first user checks the data spot check result obtained by data spot check to obtain the check result, the first user feeds the check result back to the second user, and the second user checks again, so that the interaction cost of both sides can be reduced, and the check efficiency can be improved. Calculating a password hash function value for the data to be checked by using the password hash function, wherein the random seed is obtained from the password hash function value, processing the random seed to generate sequence information, and extracting the sequence information to obtain a data sampling check result; the random seeds are fixed, the random sequence is fixed, the data sampling inspection result is also fixed, accurate evaluation on data quality is facilitated, and the delivery passing rate is improved.

The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.

Drawings

Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to refer to like parts throughout the drawings. In the drawings:

fig. 1 is a flowchart illustrating steps of a data extraction method according to an embodiment of the present invention;

fig. 2 is a block diagram of a data spot check device according to an embodiment of the present invention.

Detailed Description

Exemplary embodiments of the present invention will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the invention are shown in the drawings, it should be understood that the invention can be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.

The terms first, second and the like in the description and in the claims of the present application are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It will be appreciated that the data so used may be interchanged under appropriate circumstances such that embodiments of the application may be practiced in sequences other than those illustrated or described herein, and that the terms "first," "second," and the like are generally used herein in a generic sense and do not limit the number of terms, e.g., the first term can be one or more than one. In addition, "and/or" in the specification and claims means at least one of connected objects, a character "/" generally means that a preceding and succeeding related objects are in an "or" relationship.

The data spot check method, device, electronic device and readable storage medium provided in the embodiments of the present application are described in detail below with reference to the accompanying drawings through specific embodiments and application scenarios thereof.

A first embodiment of the present invention relates to a data extraction method, a flowchart of which is shown in fig. 1, and includes:

step 101, acquiring data, and processing the data to obtain a random seed, wherein the random seed is obtained by a password hash function value;

it should be noted that, in this embodiment of the present application, a party b provides data to a party a, the party a checks the data provided by the party b, and before performing the check, the two parties need to determine which data needs to be checked, perform operation on the data to be checked by using the cryptographic hash function to obtain the cryptographic hash function value, use the cryptographic hash function value as a random seed, and use the computer programming language to generate a random sequence.

In the embodiment of the present invention, the data may be text data, audio data, image data, including but not limited to these.

In the embodiment of the present invention, the cryptographic hash function may be a specific cryptographic hash function such as MD5, which is not limited in this application.

The operation model of the password hash function is as follows: h ═ h (m). Wherein M is data to be processed; h is the one-way hash function and H is the generated cryptographic hash function value. Wherein H has the following individual properties: given H and M, H is easily calculated; given H and H, M is difficult to calculate; given H, it is computationally infeasible to find two different M's, such that the calculated H is the same.

102, processing the random seeds to generate sequence information;

it should be noted that, in the embodiment of the present application, the random seed is input into a computer programming language, and the computer programming language is operated to obtain a random sequence, so that the random seed is fixed, and then the generated random sequence is also fixed.

In an actual using process, many computer programming languages for generating the random sequence from the random seed may be supported, and the above examples are only illustrative, and the present application is not limited thereto.

Step 103, extracting in the sequence information to obtain a data sampling inspection result;

it should be noted that, in the embodiment of the present application, assuming that 10 of 10000 data are extracted for acceptance, a random sequence with a length of 10000 is generated, and then the first 10 numbers are selected, so that the probability of each piece of data being extracted is equal. The N data are extracted in a certain manner, the manner of the extraction may be any manner, the above example is only an example, and the present application is not limited thereto.

In the embodiment of the application, the manual translation of a certain language is excessive due to the participation of translation personnel, the level is uneven, the training management communication cost is high, before the first party performs data spot check on the translation result of the second party language, the two parties need to confirm the translation result of the second party language in advance to check, then the second party performs data spot check, the obtained data spot check result is checked to obtain the check result, the check result is passed through, and the first party can pass through the check at one time.

And the acceptance result is not passed, and the second party feeds back the acceptance result to the first party.

In the embodiment of the application, the above operation is based on the mutual credible cooperation state of the first party and the second party.

When the two parties of the first party and the second party are not in a credible cooperation state, for example, the second party checks the data random inspection result, the checked result is not passed, the second party modifies and resubmits the data which is not passed, and the step of repeated data random inspection by the second party is a brand-new data random inspection result.

According to the embodiment of the invention, before the two parties check, the first user and the second user confirm the data to be checked, the first user checks the data sampling result obtained by data sampling to obtain the checking result, the first user feeds the checking result back to the second user, and the second user checks again, so that the interaction cost of the two parties can be reduced, and the checking efficiency can be improved. Calculating a password hash function value for the data to be checked by using the password hash function, wherein the random seed is obtained from the password hash function value, processing the random seed to generate sequence information, and extracting the sequence information to obtain a data sampling check result; the random seeds are fixed, the random sequence is fixed, the data sampling inspection result is also fixed, accurate evaluation on data quality is facilitated, and the delivery passing rate is improved.

A second embodiment of the present invention relates to a data spot check device, a flowchart of which is shown in fig. 2, including:

a receiving module 201, configured to acquire data and process the data to obtain a random seed, where the random seed is obtained by a cryptographic hash function value;

a generating module 202, configured to process the random seed to generate sequence information;

and the extraction module 203 is configured to extract in the sequence information to obtain a data sampling result.

According to the embodiment of the invention, the data to be checked is confirmed before the two parties check, so that the interaction cost of the two parties can be reduced, and the checking efficiency can be improved. And calculating a password hash function value for the data to be checked by using the password hash function, wherein the random seed is obtained by the password hash function value, so that the random seed is fixed, the random sequence is fixed, the data sampling inspection result is also fixed, the accurate evaluation of the data quality is facilitated, and the delivery passing rate is improved.

For the device embodiment, since it is basically similar to the method embodiment, the description is simple, and for the relevant points, refer to the partial description of the method embodiment.

Those skilled in the art will appreciate that the modules in the device in an embodiment may be adaptively changed and disposed in one or more devices different from the embodiment. The modules or units or components of the embodiments may be combined into one module or unit or component, and furthermore they may be divided into a plurality of sub-modules or sub-units or sub-components. All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and all of the processes or elements of any method or apparatus so disclosed, may be combined in any combination, except combinations where at least some of such features and/or processes or elements are mutually exclusive. Each feature disclosed in this specification (including any accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.

Furthermore, those skilled in the art will appreciate that while some embodiments described herein include some features included in other embodiments, rather than other features, combinations of features of different embodiments are meant to be within the scope of the invention and form different embodiments. For example, in the claims, any of the claimed embodiments may be used in any combination.

It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The usage of the words first, second and third, etcetera do not indicate any ordering. These words may be interpreted as names.

完整详细技术资料下载
上一篇:石墨接头机器人自动装卡簧、装栓机
下一篇:一种融合关系分析方法及系统

网友询问留言

已有0条留言

还没有人留言评论。精彩留言会获得点赞!

精彩留言,会给你点赞!